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REPORT  ON  THE  CONGRESS 
Josef  Singer 


The  decision  to  accept  the  invitation  from  Israel  to  hold  the  XVHIth  International  Congress 
in  Haifa  was  taken  by  the  Congress  Committee  of  IUTAM  during  its  meeting  in  Grenoble  in 
August  1988.  It  was  decided  that  the  format  of  the  Congress  would  follow  the  one  adopted  for 
the  recent  successful  Congresses  in  Lyngby  and  Grenoble.  The  Congress  would  cover  the 
entire  field  of  mechanics  with  special  emphasis  on  three  selected  topics  to  constitute  the  so- 
called  minisymposia. 

The  Congress  Committee  selected  the  following  three  topics  for  these  minisymposia: 

1 .  Instabilities  in  Solid  and  Structural  Mechanics. 

2 .  Sea  Surface  Mechanics  and  Air-Sea  Interaction. 

3 .  Biomechanics. 

The  Congress  Committee  also  selected  two  general  lecturers:  Professor  Anatol  Roshko 
(USA)  to  present  the  Opening  Lecture  and  Professor  G.I.  Barenblatt  (Russia)  to  present  the 
Closing  Lecture;  as  well  as  15  Sectional  Lectures.  The  chairmen  of  the  Minisymposia  further 
selected  9  Introductory  Lectures  for  their  symposia. 

As  in  Lyngby  and  Grenoble,  the  contributed  papers  were  presented  in  parallel  sessions, 
either  as  25  minute  lectures  or  in  poster- sessions  which  were  scheduled  separately  from  the 
lectures.  There  were  lively  d;scussions  both  after  the  lectures  and  in  the  poster-sessions,  where 
the  second  half  of  the  sessions  were  devoted  to  general  discussions  guided  by  the  chairmen. 

The  Opening  Session  of  the  Congress  was  held  in  the  Churchill  Auditorium  of  the 
Technion  at  10  o'clock  on  Sunday,  23rd  August,  1992.  The  Session  was  opened  by  Professor 
Josef  Singer,  Chairman  of  the  Local  Organizing  Committee  of  ICTAM  '92,  with  the  following 
words: 

"Distinguished  Members  of  the  Dais,  Ladies  and  Gentlemen, 

It  gives  me  great  pleasure  to  open  the  18th  International  Congress  of  Theoretical  and 
Applied  Mechanics  and  to  greet  this  outstanding  gathering  of  scientists  and  engineers  which 
will  ensure  the  success  of  the  Congress. 

Welcome  to  Israel,  to  Haifa  and  to  the  Technion,  Israel  Institute  of  Technology. 

Let  me  also  greet  you  in  Hebrew  (welcome  to  participants). 

I  think  you  will  understand  me  better  if  I  do  not  continue  in  Hebrew.  Maybe  by  the  time  we 
come  to  the  Closing  Session,  your  Hebrew  will  be  good  enough  to  permit  me  to  address  you 
all  in  our  language. 

So  let  me  introduce  you  to  the  Dais: 

I  will  start  at  the  far  end  of  the  Head  Table.  There  are  the  Deans  of  the  two  Technion 
Faculties  which  are  most  active  in  Mechanics: 

Professor  Blech,  Acting  Dean  of  the  Faculty  of  Mechanical  Engineering,  and  Professor 
Shinar,  Dean  of  the  Faculty  of  Aerospace  Engineering. 

Next,  the  Secretary-General  of  IUTAM,  Professor  Schielen  of  Stuttgart  University,  who  is 
entrusted  with  the  many  IUTAM  Symposia  and  other  IUTAM  activities. 
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To  his  left,  the  President  of  the  Israel  Society  for  Theoretical  and  Applied  Mechanics, 
Professor  Hashin  of  the  Faculty  of  Engineering  of  Tel  Aviv  University,  an  active  member  of 
our  Organizing  Committee, 

Next,  the  Secretary  of  the  IUTAM  Congress  Committee,  Professor  Moffatt  of  Cambridge 
University,  whose  guidance  and  tireless  efforts  have  been  essential  to  the  success  of  the 
scientific  program  and  the  preparations  of  the  Congress. 

To  his  left,  the  Mayor  of  our  beautiful  city,  Haifa,  Mr.  Gurel,  who  graciously  supports  our 
meeting  and  many  other  international  activities. 

Then  our  IUTAM  President,  Professor  Germain,  of  the  French  Academy  of  Science, 
whom  many  of  you  also  remember  as  the  Organizer  of  the  successful  Congress  in  Grenoble  in 
1988. 

To  his  left,  Professor  Paul  Singer,  Senior  Vice  President  of  Technion,  who  will  soon  bring 
to  you  the  greetings  of  the  Technion  President. 

Then  the  IUTAM  Vice  President,  Sir  James  Lighthill,  the  past  IUTAM  President  and 
former  Vice  Chancellor  of  University  College  London. 

To  his  left.  Professor  Bodner  of  our  Faculty  of  Mechanical  Engineering,  the  Co-Chairman 
of  our  Local  Organizing  Committee,  who  gave  his  time  and  knowledge  untiringly  and  thus 
contributed  so  much  to  our  Congress. 

Then  the  Treasurer  of  IUTAM,  Professor  van  Wijngaarden  of  the  University  of  Twente  in 
The  Netherlands,  whose  importance  is  self-evident. 

And  last,  but  certainly  not  least,  the  wonderful  Secretary  of  our  Local  Organizing 
Committee,  Professor  Alex  Solan,  Technion  Vice  President  for  Academic  Affairs,  with  whom 
most  of  you  have  already  corresponded  and  without  whose  outstanding  and  never  ending  work 
we  could  not  have  carried  out  the  preparations  for  the  Congress. 

ICTAM  Participants,  as  I  look  around  me  I  see  many  friends,  the  top  scientists  of  the 
international  mechanics  community,  the  international  mechanics  family.  I  am  sure  that  our 
deliberations  will  not  only  advance  our  field,  but  will  also  weave  many  new  collaborations  and 
friendships. 

For  the  benefit  of  the  delegates  who  are  new  to  ICTAM,  I  would  like  to  ask  some  of  the 
former  IUTAM  Presidents  to  please  rise:  Professor  Frithiof  Niordson  of  Denmark,  and 
Professor  Daniel  Drucker  of  U.S.A. 

To  finish  the  introductions,  it  gives  me  great  pleasure  to  see  here  among  us,  my  former 
teacher  and  good  friend.  Professor  Nicholas  Hoff  of  Stanford  University,  who  taught  me  the 
meaning  of  theoretical  and  applied  mechanics  and  much  more,  and  whom  many  of  you  will 
remember  as  the  President  and  Organizer  of  our  1968  Congress  at  Stanford.  Please  rise 
Nicholas. 

Friends,  I  may  have  taken  up  some  time  with  these  introductions,  but  I  remember  that  at 
the  first  ICTAM  I  attended  in  Stresa  in  1960,  seeing  the  great  names  in  the  flesh  was,  indeed,  a 
great  experience! 

Ladies  and  Gentlemen,  we  Israelis  are  honored  and  happy  to  host  the  18th  International 
Congress  of  Theoretical  and  Applied  Mechanics  in  our  old-new  Land. 

Though  the  ancient  Israelis  are  better  known  for  their  Monotheism  than  for  their 
achievements  in  mechanics,  one  finds  that  they  tried  their  hand  in  some  large  scale  experiments 
in  mechanics. 
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For  example,  in  the  Book  of  Exodus  we  hear  that  in  their  flight  from  Egypt,  the  Children 
of  Israel  (with  some  assistance  from  the  Almighty)  experimented  with  sea-air  interactions,  for 
we  read  in  Exodus  14/21,22: 

"And  Moses  stretched  out  his  hand  over  the  sea  and  the  Lord  caused  the  sea  to  go  back  by 
a  strong  east  wind . and  made  the  sea  dry  land  and  the  waters  were  divided". 

"And  the  Children  of  Israel  went  into  the  midst  of  the  sea  upon  the  dry  ground,  and  the 
waters  were  a  wall  unto  them  on  their  right  and  on  their  left”. 

Would  this  not  Fit  into  our  second  Minisymposium?! 

Or,  in  the  Book  of  Joshua,  we  hear  of  an  early  experiment  in  the  dynamics  of  sound 
waves,  for  we  read  in  Joshua  6/5: 

"And  it  shall  come  to  pass,  that  when  they  make  a  long  blast  with  a  ram's  horn,  and  when 
ye  hear  the  sound  of  the  trumpet,  all  the  people  shall  shout,  and  the  wall  of  the  city  shall  fail 
down  flat". 

So,  there  was  some  mechanics  activity  here  3,500  years  ago! 

In  modem  times,  Israel  has  a  small  but  very  active  mechanics  community,  whose  presence 
and  achievements  are  known  internationally,  and  whose  participation  in  IUTAM  Congresses 
and  Symposia  has  been  very  significant 

At  the  Technion  we  were  fortunate  to  have  had  Professor  Marcus  Reiner,  well  known  as 
one  of  the  Fathers  of  Rheology,  active  here  from  1947-1976.  He  built  a  thriving  department  of 
mechanics,  that  later  diffused  into  other  departments. 

In  1961  Professor  Reiner  organized  an  IUTAM  Symposium  on  "Secondary  Effects  in 
Elasticity,  Plasticity  and  Fluid  Dynamics",  in  Haifa  (by  the  way,  my  wife  worked  with  him  on 
the  organization  of  that  symposium)  a  symposium  which  was  the  occasion  of  the  first  visit  to 
Israel  for  some  of  you. 

In  1985  Professors  Bodner  and  Hashin  organized  another  IUTAM  Symposium  on 
"Mechanics  of  Damage  and  Fatigue",  at  Technion,  which  was  also  very  successful. 

Hence,  an  ICTAM  at  Haifa  seem  to  follow  logically.  Well,  some  of  us  in  the  audience  have 
aged  a  bit  since  the  first  time  Israel  offered  to  host  ICTAM  many  years  ago,  but  I  am  happy 
that  we  all  made  it  finally! 

We  have,  I  believe,  an  excellent  scientific  program  thanks  to  you  the  contributors  and  to  the 
excellent  work  of  our  International  Papers  Committee.  As  you  know,  the  choice  was  difficult 
and  the  600  papers  chosen  out  of  1,183  submitted  originated  in  48  different  countries. 
However,  as  we  had  many  and  continuous  changes,  even  in  the  number  of  countries,  the 
program  you  have  could  be  finalized  only  two  weeks  ago. 

The  flags  on  the  platform  represent  all  the  countries  from  which  papers  were  accepted.  The 
flags  indicate  origins  of  papers,  but  we  are  all  here  as  individuals,  contributing  to  theoretical 
and  applied  mechanics.  Individuals  whose  friendship  and  collaboration  will  be  reinforced  by 
this  Congress  and  who  will  assist  in  bringing  our  nations  closer  and  improve  their  relations. 

Before  closing.  I  would  like  to  thank  the  IUTAM  Congress  Committee,  and  in  particular  its 
Executive  Committee,  the  IUTAM  Bureau,  the  International  Papers  Committee,  as  well  as  the 
different  National  Committees.  I  will  express  my  gratitude  to  my  local  collaborators,  who 
worked  so  hard  to  prepare  the  Congress,  in  more  detail  at  the  Closing  Session.  For  now,  only 
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sincere  thanks  to  you,  my  colleagues  of  the  Local  Organizing  Committee  and  to  the  devoted 
staff  of  the  Technion  and  Kenes  teams  who  will  also  look  after  us  in  the  coming  days. 

I  would  also  like  to  thank  the  Israel  Academy  of  Sciences  and  Humanities,  the  various 
Israeli  government  ministries,  our  municipality,  and  the  Haifa  Tourist  Board,  the  universities 
and  industrial  companies,  whose  names  appear  on  the  program  and,  in  particular,  the  U.S.  Air 
Force  European  Office  of  Aerospace  Research  and  Development,  and  the  U.S.  Army  European 
Research  Office,  for  their  generous  support 

As  you  know,  the  Israel  Academy  is  one  of  the  sponsors  of  our  Congress.  Its  President, 
Professor  Joshua  Jortner,  who  is  abroad,  sent  us  his  greetings  which  I  would  like  to  read  to 
you: 

"The  Israel  Academy  recognizes  the  important  contribution  international  meetings  make  to 
the  advancement  and  excellence  of  research,  by  providing  a  forum  for  direct  contact  between 
scientists  and  engineers  engaged  in  high  quality  scientific  endeavor. 

I  am  confident  that  the  18th  International  Conference  in  Theoretical  and  Applied  Mechanics 
will  contribute  to  the  advancement  and  the  enhanced  cooperation  in  these  exciting  and 
important  fields  of  scientific  and  technological  research.  The  integration  of  science  and 
technology  which  is  so  well  reflected  in  your  Conference  is  of  prime  importance  for  Israel  and 
for  the  international  community  at  large. 

Please  convey  my  compliments,  on  behalf  of  the  Israel  Academy  of  Sciences  and 
Humanities,  to  all  the  participants. 

I  wish  you  a  fruitful  and  stimulating  meeting." 

Ladies  and  Gentlemen,  I  concur  with  Professor  Jortner,  and  wish  us  all,  delegates  and 
accompanying  persons,  a  fruitful  and  enjoyable  Congress,  and  a  very  pleasant  stay  in  Israel. 
Thank  you." 

Professor  Singer  then  called  on  Professor  Solan,  the  Secretary  of  the  Local  Organizing 
Committee,  to  introduce  the  speakers  who  followed. 

Mayor  Gurel  then  brought  the  greetings  of  the  Municipality  of  Haifa  to  the  Congress 
participants  and  wished  them  successful  deliberations  and  a  pleasant  stay  in  Haifa  and  Israel. 

Since  the  President  of  the  Technion  was  abroad,  the  Senior  Vice  President,  President  Paul 
Singer,  greeted  the  assembly  with  the  following  words: 

"Distinguished  Guests  and  Friends, 

It  is  my  privilege  and  honor  to  welcome  you,  participants  of  the  International  Congress  of 
Theoretical  and  Applied  Mechanics  to  the  Technion  -  Israel  Institute  of  Technology,  with  the 
traditional  Hebrew  greeting  -  "Blessed  be  those  who  came”.  -  This  Congress  is  the  XVIII  th  in 
a  series  which  started  in  Delft,  Netherlands  and  carries  with  it  by  now  a  most  famous 
tradition.  We  are  therefore  very  proud  and  consider  it  a  great  distinction,  that  for  the  first  time 
the  Congress  convenes  in  Israel,  and  the  Technion  Management  is  especially  proud  that  our 
scientists  have  been  entrusted  with  the  task  of  organizing  and  hosting  this  event. 

But,  there  is  another  "FIRST"  which  I  would  like  to  point  out  to  you.  A  perusal  of  the 
previous  sites  shows  that  you  are  meeting  for  the  first  time  at  an  altitude  of  several  hundred 
meters  above  the  sea  level,  - 1  am  confident  this  will  bring  to  this  Meeting  'Scientific  heights” 
previously  unknown.  You  have  certainly  chosen  a  very  unique  location:  Mount  Carmel  (on  one 
of  its  hills  we  are  here)  -  was  mentioned  already  a  few  thousand  years  ago.  For  instance,  in  the 
Book  of  Jeremiah  one  finds  "To  the  CARMEL  by  the  sea,  -  one  comes".  This  mountain  is 
perceived  and  mentioned  from  very,  very  old  times  as  a  "SAFE  SHORE”.  The  ancient 
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Egyptians  used  to  call  it  "The  Holy  Summit"  and  the  famous  Egyptian  Emperor  (Pharaoh) 
Ramses  the  Second,  nicknamed  it  in  the  13th  Century  B.C.,  "The  Mountain  of  Strength". 
Possibly  the  most  famous  personality  associated  with  it  is  the  Prophet  Elijah  who  used  to 
prophesize  and  build  his  favorite  models  and  theories  and  indoctrinate  his  followers  right  here 
on  these  hills.  I  mention  this,  since  in  the  Ninth  Century  B.C.,  during  the  reign  of  King  Ahab 
of  Israel,  a  very  famous  Assembly  was  organized  by  the  Prophet  Elijah  on  Mount  Carmel  in 
the  presence  of  the  King,  possibly  the  precursor  of  the  subsequent  scientific  conferences  held 
on  this  mountain.  About  850  people  attended  that  famous  Assembly,  not  very  different  from 
the  attendance  of  this  Congress,  at  which  some  ideological  differences  (theories  or  approaches 
in  our  present  language)  were  confronted.  Well,  here  die  parallel  stops,  since  meetings  in  those 
times  could  have  very  violent  endings  as  happened  in  the  Assembly  convened  by  King  Ahab. 

Interestingly,  the  location  of  Prophet  Elijah's  home-cave  is  determined  to  be  in  two 
different  places,  some  300  meters  apart,  by  the  Christian  and  Jewish  faith.  Given  the  distance 
of  nearly  3000  years  on  the  time  scale,  this  is  pretty  good  accuracy. 

Referring  anew  to  your  Congress  series,  I  was  struck  by  an  interesting  historical 
coincidence.  The  first  ICTAM  took  place  in  DELFT  in  1924,  and  in  the  very  same  year  our 
University,  the  Technion,  opened  its  gates  for  business,  enrolling  its  first  14  students  with  7 
Faculty  members  (a  pretty  good  ratio  we  did  not  succeed  in  maintaining)  -  as  the  first 
University  in  modem  Israel  -  it  was  followed  less  than  a  year  later  by  the  Hebrew  University 
of  Jerusalem.  So,  the  date  of  your  first  Congress  is  also  a  cornerstone  in  the  history  of  Modem 
Israeli  Culture. 

Today,  we  have  19  Faculties  and  Departments,  and  a  number  of  Research  Institutes,  in 
most  fields  of  Engineering  and  Sciences,  as  well  as  Architecture  and  Medicine.  There  are  now 
over  10,000  students  at  the  Technion,  among  them  nearly  3,000  postgraduate  students 
researching  and  studying  for  higher  degrees.  We  have  about  700  full-time  Faculty  members 
and  many  hundreds  of  associated  scientists  and  adjunct  teachers.  I  hope  you  will  have  the 
chance  to  visit  and  meet  some,  though  this  is  the  week  of  our  annual  vacation. 

As  you  know,  Israel  is  a  country  of  small  size  and  limited  natural  resources.  As  such,  we 
put  special  emphasis  on  Science  and  Technology,  the  prime  component  in  our  strive  to 
economic  independence  and  the  building  of  an  advanced  society.  In  this  context,  we 
considerably  value  international  cooperation  of  which  scientific  congresses  are  an  important 
form. 

The  recent  political  changes  in  the  world  have  opened  new  avenues  for  international 
cooperation  and  for  us  in  Israel  they  have  significant  implications.  A  sizable  part  of  the 
scientific  world  was  practically  unreachable  for  us;  this  has  changed  completely  during  the  last 
couple  of  years,  -  as  the  composition  of  this  assembly,  I  believe,  shows.  Moreover,  the 
energetic  pace  of  European  unification  and  the  related  developing  of  special  technological  and 
scientific  collaboration  schemes  of  international  impact  is  also  of  great  relevance  for  us.  As  a 
result,  we  are  undergoing  here  a  process  of  orientation  towards  a  multitude  of  new 
opportunities,  and  I  am  confident  that  the  outcome  will  benefit  all  those  involved. 

Maybe  I  should  also  mention  that  on  historic  scales  (I  already  mentioned  history  before)  the 
direction  of  international  exchanges  has  gone  through  cycles.Today,  there  is  no  doubt  that  the 
leading  Mecca's  for  Science  and  Technology  are  mostly  in  Europe  and  the  United  States  of 
America.  In  his  Nobel  acceptance  speech  in  1979,  the  Pakistani  physicist,  Abdus  Salam.  from 
Imperial  College,  London,  described  through  individual  examples  the  clear  direction  of 
technology  transfer  in  the  early  centuries  of  this  millennium.  He  mentioned  the  cases  of 
Michael  the  Scott  and  the  Danish  physician  Henrik  Harpestraeng  who  travelled  in  the  1 3th 
Century  from  the  underdeveloped  countries  of  the  North  to  the  flourishing  Universities  of  the 
South,  Toledo,  Cordoba.  In  these  places  the  Arabic  and  Hebrew  Scholarship  was  then  at  its 
peak,  and  scholars  were  attracted  to  these  and  other  Southern  Universities  from  both  the 
developed  countries  of  the  Middle  East  and  Middle  Asia,  as  well  as  from  the  developing 
northern  countries. 


It  is  my  wish  that  we  here  do  the  utmost,  so  that  again  we  may  confidently  say  That 
Scholarship  will  emerge  from  Zion".  Meanwhile,  on  the  way  to  this  goal.  1  wish  that  your 
Congress  will  contribute  significantly  to  the  increase  of  knowledge  and  will  stimulate  and 
inspire  you  in  your  future  work.  I  hope  you  have  an  interesting,  enjoyable,  fruitful  and 
pleasant  week  at  the  Technion  in  Haifa." 

Professor  Zvi  Hashin,  President  of  the  Israel  Society  for  Theoretical  and  Applied 
Mechanics,  then  brought  his  greetings: 

"Members  of  the  Presidium,  Ladies  and  Gentlemen, 

l  have  the  honor  and  privilege  to  bring  you  the  greetings  of  the  Israel  Society  for 
Theoretical  and  Applied  Mechanics.  Our  society  is  one  of  the  many  national  societies  which  are 
federated  into  the  International  Union  of  Theoretical  and  Applied  Mechanics  (fUTAM).  The 
Israel  Society  was  founded  in  1950,  only  two  years  after  the  establishment  of  the  State  of 
Israel  in  1948,  by  my  dear  and  esteemed  former  teacher  and  advisor  Prof.  Markus  Reiner.  He 
also  was  the  first  President  of  the  Society,  until  he  passed  away  in  1976.  It  has  been  my  great 
privilege  to  replace  him  as  President  of  the  Society  from  1976  until  the  present.  Professor 
Bodner  and  myself  are  the  delegates  of  Israel  to  the  1UTAM  General  Assembly.  Professor 
Singer  was  our  member  in  the  1UTAM  Congress  Committee  until  1988.  Since  1976  we  have 
consistently  extended  invitations  to  IUTAM  to  hold  the  International  Congress  of  Theoretical 
and  Applied  Mechanics  (ICTAM),  in  Israel  and  we  are  gratified  and  honored  to  have  been 
selected  during  the  last  ICTAM  in  Grenoble  1988  to  host  this  major  event  in  1992.  We  would 
like  to  believe  that  it  is  a  recognition  of  the  growth  and  maturity  of  the  mechanical  sciences  in 
our  country. 

The  Israel  Society  for  Theoretical  and  Applied  Mechanics  is  grateful  to  Professor  Josef 
Singer  for  having  accepted  its  invitation  to  chair  the  Local  Organizing  Committee,  and  to  all  the 
members  of  the  Committee.  In  particular,  we  wish  to  express  special  gratitude  to  Professor  Sol 
Bodner,  Co-Chairman,  and  to  Professor  Alexander  Solan,  Secretary,  for  the  enormous 
amounts  of  time  and  effort  they  have  generously  contributed  to  create  ICTAM  1 8. 

Thank  you  very  much  and  I  wish  all  of  us  a  very  successful  and  pleasant  Congress.” 

Professor  Sol  R.  Bodner,  Co-Chairman  of  the  Local  Organizing  Committee,  then 
presented  his  greetings: 

"I  would  like  to  make  a  few  remarks  on  this  occasion. 

Exactly  forty  years  ago,  as  a  young  graduate  student  working  with  Professor  Nicholas 
Hoff  at  the  Brooklyn  Polytechnic  Institute,  I  attended  the  8th  International  Congress  of 
Mechanics  in  Istanbul.  It  was  a  wonderful  affair  -  the  scientific  level  of  the  presentations,  the 
extraordinary  hospitality  of  the  hosts,  and  the  marvellous  sights  of  the  city.  FYesent  there  were 
the  famous  personalities  of  Mechanics  -  Prager,  who  as  partial  host  gave  a  brief  talk  in 
Turkish,  von  Mises,  Courant,  - 1  believe  von  Karman  and  G.I.  Taylor  were  there,  and  many 
others.  My  impression  from  that  Congress  was  that  Mechanics  was  an  excellent  field  in  which 
to  make  a  career. 

Now,  forty  years  later,  the  Congress  has  returned  to  the  Eastern  Mediterranean  region  and 
we  are  again  gathered  at  its  shore.  I  still  consider  Mechanics  as  a  field  that  offers  continuing 
possibilities  for  creative  and  intellectually  demanding  and  satisfying  work.  It  is  impossible  to 
recreate  the  atmosphere  of  Istanbul  of  1952.  but  we  shall  do  our  best  to  offer  the  hospitality  of 
the  region. 

The  logo  of  the  Congress,  the  ship,  is  taken  from  the  emblem  of  the  port  city  of  Haifa,  the 
fancy  sail  of  the  ship  is  the  Hebrew  word  Chai  which  means  18  and  also  "life",  or  more 
literally  "alive”. 
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I  would  like  to  add  my  personal  welcome  to  all  of  you  who  came  to  Haifa  for  this 
Congress." 

Finally,  Professor  Paul  Germain,  President  of  IUTAM  addressed  the  gathering: 

"It  is  my  privilege,  as  President  of  IUTAM,  to  close  this  official  opening  ceremony  and  to 
declare  open  our  XV///  th  International  Congress  of  Theoretical  and  Applied  Mechanics,  but 
before  I  do  it,  I  intend  to  say  a  few  words,  to  thank  our  hosts,  to  recall  the  objectives  and  the 
responsibility  of  IUTAM  and  to  emphasize  the  role  and  the  importance  of  such  a  Congress  for 
the  modem  development  of  Mechanics. 

We  are  invited  today  by  a  country  and  a  people  who.  at  the  same  time  are  very  old  and  very 
new.  Thank  you.  Mayor  Gurel  for  your  presence  this  morning  and  for  your  welcome.  You 
represent  here  both  of  them.  A  country  which  has  to  face  many  difficult  problems,  which  must 
fight  every  day  to  solve  them,  which  hopes  to  get  and  which  has  to  get,  full  recognition  by  all 
its  neighbours  and  then,  which  has  to  reach  and  to  win  the  peace,  a  true  peace,  a  peaceful 
peace.  A  land  to  which  so  many  in  the  world  are  so  deeply  related  because  they  feel  that,  here, 
lie  some  of  their  deepest  roots.  A  people  who,  more  than  3000  years  ago,  has  begun  to  give 
the  world  an  incomparable  source  of  moral  and  spiritual  life  in  which  so  many  in  the  world 
today  continue  to  draw  inspiration,  stimulation  and  strength.  A  people  who  was  dispersed 
during  centuries,  who  has  too  often  suffered  unjusdy  and  especially  during  the  first  half  of  the 
century  with  this  incomprehensible  and  inadmissible  Shoah.  A  people  who  is  today  engaged 
with  great  success  in  modem  life  and  in  particular  the  scientific  life.  One  of  the  evidences  is 
this  Technion,  this  very  famous  Institute  of  Technology.  Our  best  thanks  go  to  its  President 
who  has  accepted  that  our  Congress  be  held  here.  Many  years  ago,  as  it  was  recalled,  our 
colleagues  from  the  Israel  Society  of  Mechanics  asked  our  Congress  Committee  to  hold  one  of 
our  international  congresses  in  their  country.  Despite  the  high  standard  of  their  achievements  in 
the  field  of  Mechanics,  it  was  not  found  possible  to  accept  their  invitation  on  account  of  the 
international  situation  until  four  years  ago  at  our  last  meeting  in  Grenoble.  Many  unpredictable 
events  happened  in  the  world  since  that  time.  But  one  must  recognize  today  that  the  choice 
which  looked  a  little  risky  was  a  reasonable  one.  No  special  difficulty  arose  for  the  attendance 
from  the  choice  of  the  location.  With  the  help  of  Technion  and  of  the  civilian  authorities,  the 
Local  Organizing  Committee  under  the  friendly  supervision  of  our  Congress  Committee  was 
able  to  do  the  necessary  work  to  hold  this  Congress  in  very  good  conditions.  Without  waiting 
for  the  closing  session  which  is  the  most  appropriate  time  to  express  our  recognition,  we  may 
already  express  to  Professor  Singer  and  to  his  colleagues  our  gratitude. 

Dear  colleagues  and  friends,  at  the  present  time,  the  interactions  of  science  and  technology 
with  the  cultural,  social  economical,  ecological  problems  are  very  strong.  As  scientists  we  are, 
each  one  of  us,  concerned  by  such  a  responsibility.  Our  Union  provides  a  good  tool  to  assume 
this  responsibility  at  the  international  level,  by  its  own  initiatives  first  and  second  through 
ICSU,  the  International  Council  of  Scientific  Unions.  As  you  know  IUTAM  is  one  of  them. 
ICSU  with  its  various  specialized  Committees  has  a  role  which  is  every  day  more  important,  in 
particular,  as  a  partner  on  account  of  its  scientific  expertise  of  the  United  Nations  and  of  its 
numerous  programs  and  organizations.  The  fears  and  the  expectations  of  the  people  cannot  be 
ignored.  The  understanding  and  the  appreciation  of  science  have  to  be  improved.  I  am  glad  to 
tell  you  that  IUTAM  has  significantly  increased  its  involvement  in  the  ICSU  work,  by 
encouraging  the  operations  in  which  the  scientific  content  plays  the  principal  role  and  by 
participating  in  those  when  its  own  expertise  is  very  high.  The  best  example  is  the  scientific 
program  of  the  International  Decade  for  Natural  Disaster  Reduction  run  by  a  group  of  experts, 
coming  from  various  Unions,  chaired  by  Sir  James  Lighthill,  our  former  President,  with  a 
remarkable  success.  The  influence  of  IUTAM  and  of  its  voice  inside  ICSU  is  a  function  of  the 
support  it  received  from  the  mechanicians  all  over  the  world.  And  one  measure  of  this  support 
is  the  presence  and  the  participation  of  many  of  them  at  the  International  Congress.  It  is  one 
first  reason  to  express  to  all  of  you  my  thanks  on  behalf  of  the  Bureau  and  of  the  General 
Assembly  . 
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But  our  main  task,  our  unique  task  this  week,  concerns  mechanics  and  its  development; 
mechanics  an  old  science,  but  a-  always  young  one.  Those  who  like  me  attended  many  such 
International  Congresses  can  tell  you  how  fruitful  these  meetings  are.  They  give  to  each 
participant  as  in  any  other  scientific  meeting  the  opportunity  to  communicate  directly  with  the 
colleagues  working  in  the  same  area,  by  letung  them  know  their  new  results  and  by  learning 
from  them  the  latest  progress.  But  you  find  more  in  such  a  Congress:  the  possibility  to  devote 
one  week  to  your  own  on-going  education,  to  broaden  your  interest  to  the  neighbouring 
disciplines,  to  become  aware  to  the  new  discoveries,  to  see  the  evolution  of  ideas,  the  creation 
of  new  concepts,  the  fantastic  improvements  in  theory  and  applications.  Our  two  General 
Lectures,  our  invited  Sectional  Lectures,  our  three  mini-symposia  with  their  lectures  of 
pedagogical  character  meet  precisely  these  purposes.  And  one  discovers  that  the  contributions 
selected  by  the  International  Papers  Committee  represent  stones  which  are  necessary  to  build 
this  wonderful  monument  of  mechanics,  not  yet  achieved,  always  more  beautiful,  more 
impressive,  more  useful,  300  years  after  Newton.  Yes,  may  all  of  us,  and  especially  the 
younger  ones,  enjoy  their  participation;  may  they  draw  the  greatest  possible  benefit  of  this 
week.  For  those  who  have  prepared  this  meeting,  that  is  their  main  wish;  that  would  represent 
the  best  reward. 

And  now,  it  is  time  to  start.  I  declare  open  the  XV///  th  International  Congress  of 
Theoretical  and  Applied  Mechanics  and  leave  the  chair  to  Sir  James  Lighthill  who,  within  a 
few  minutes,  after  a  few  announcements  by  Professor  Solan,  will  introduce  the  speaker  invited 
to  deliver  the  opening  General  Lecturer.  Thank  you  for  your  kind  attention." 

Sir  James  Lighthill  then  introduced  Professor  Anatol  Roshko  who  presented  the  Opening 
Lecture  "Instability  and  Turbulence  in  Shear  Rows". 

The  scientific  program  of  the  Congress  was  presented  from  Sunday  afternoon  rill  Thursday 
evening  in  5-6  parallel  sessions.  The  poster  sessions  took  place  in  8-9  adjoining  rooms  on 
three  mornings,  at  rimes  when  no  other  lecture  sessions  were  in  progress.  This  resulted  in 
good  attendance  and  very  lively  discussions. 

The  detailed  statistics  of  the  Congress  are  presented  on  the  following  page.  It  is  noted  that 
the  proportion  of  acceptances  to  submissions  was  almost  exactly  1/2.  The  proportion  of  actual 
presentations  to  acceptances  was  lower  than  anticipated  which  seems  to  be  primarily  due  to 
current  difficulties  in  a  number  of  countries  for  obtaining  travel  funds.  As  a  consequence,  the 
overall  attendance  was  also  somewhat  lower  than  expected  but  was  more  than  adequate  for  a 
very  invigorating  Congress. 

The  meaning  of  the  symbols  used  in  the  following  listing  by  countries  is  as  follows: 


S  -  Submitted  abstracts 
AL  -  Accepted  as  Lecture 

AP  -  Accepted  as  Poster 

L  -  Presented  as  Lecture 

P  -  Presented  as  Poster 

EL  -  Invited  Lecture 

*  -  The  numbers  for  S,  AL,  AP  for  Azerbaijan  and  Ukraine  and  other  countries  of  the 

former  Soviet  Union  are  included  in  Russia. 
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The  Social  Program  included  a  Welcome  Reception  on  Saturday  evening,  a  Dinner 
Reception  on  Monday  evening,  both  at  the  Churchill  Auditorium  Plaza;  the  Congress 
Excursion  on  Tuesday  afternoon  to  Rosh  Hanikxa  and  the  Crusader  City,  Acco;  a  city  tour  of 
Haifa  and  the  Druze  villages  on  Mount  Carmel  on  Thursday,  and  other  optional  tours  for 
accompanying  persons  on  Sunday,  Monday  and  Wednesday  as  well  as  pre-congress  and  post¬ 
congress  tours  to  other  parts  of  Israel.  The  Congress  Banquet  was  held  at  the  Dan  Carmel 
Hotel  on  Thursday  evening. 

The  Closing  Session  of  the  Congress  was  held  in  the  Churchill  Auditorium  at  5.30  p.m.  on 
Thursday,  27  August  1992,  immediately  after  the  Closing  Lecture  by  Professor  G.l. 
Barenblatt  on  "Micromechanics  of  Fracture”.  The  closing  ceremony  was  opened  by  Professor 
Josef  Singer,  Chairman  of  the  Local  Organizing  Committee,  with  the  following  words: 

"Ladies  and  Gentlemen, 

All  good  things  must  come  to  an  end,  and  the  18th  ICTAM  is  no  exception.  I  will  not 
repeat  in  detail  my  thanks  to  the  various  IUTAM  committees  and  to  the  different  national 
committees  for  their  excellent  work  and  continuous  assistance,  nor  to  the  Israeli  sponsoring 
organizations  and  others  for  their  generous  support 

But  I  would  like  to  express,  on  behalf  of  the  Executive  Committee  of  the  Local  Organizing 
Committee  (Professors  Bodner,  Hashin,  Solan  and  myself),  our  appreciation  to  our  local 
collaborators: 

To  Professor  Alexander  Yarin  and  Dr.  Dan  Givoli  for  their  very  significant  contributions 
and  also  to  the  other  members  of  the  LOC  for  their  support. 

To  Dalia  Sarid,  the  administrator  of  the  Faculty  of  Mechanical  Engineering,  and  to  Bernice 
Hirsch  and  Edna  Gal,  who  assisted  us  so  ably  in  solving  the  many  detailed  problems  and  last 
minute  rush  jobs,  as  well  as  to  Mottie  Fein  who  did  much  of  the  computer  programing.  Also  to 
the  Superintendents  of  the  Technion  buildings,  of  the  Churchill  Auditorium  and  especially  of 
the  Lady  Davis  Complex,  to  the  Technion  electricians  and  to  the  staff  of  the  Technion 
dormitories.  To  all  of  them,  for  their  help  to  make  Technion  facilities  available  to  us  during  the 
annual  vacation  week,  when  Technion  is  usually  closed. 

Our  thanks  also  to  the  efficient  Kenes  staff  for  their  excellent  professional  operation  during 
the  Congress  that  provided  us  with  the  daily  technical  support  for  our  deliberations  and  other 
activities. 

Ladies  and  Gentlemen, 

We  have  had  a  very  successful  Congress,  and  this  is  primarily  due  to  you,  the  lecturers, 
contributors,  chairmen  and  participants. 

As  you  know,  clouds  of  uncertainties  hung  over  our  preparations  caused  by  the  difficult 
economic  situation  of  the  countries  in  Eastern  Europe.  The  IUTAM  Bureau  and  Executive 
Committee  of  Congress  Committee  made  available  special  funds  for  partial  support  to 
participants  and  this  helped  to  disperse  some  of  those  clouds.  The  attendance  from  these 
countries  was  significantly  augmented,  and  we  have  nearly  the  same  number  of  participants 
from  Eastern  Europe  as  attended  in  Grenoble  and  even  more  from  the  countries  of  the  former 
Soviet  Union. 

Friends, 

We  were  honored  and  happy  to  be  host  to  the  18th  Congress.  We  hope  that  you  not  only 
enjoyed  the  Congress,  but  that  you  will  take  with  you  fond  memories  of  Israel  and  its  people. 
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Here  are  a  few  more  words  for  your  Hebrew  vocabulary:  toda  raba ,  which  means  thank  you 
very  much,  shalom,  which  you  know,  and  lehitraot ,  which  means  au  revoir.” 

Professor  Sol  R.  Bodner,  Co-Chairman  of  the  Local  Organizing  Committee,  then 
remarked: 

"As  one  of  the  group  that  worked  very  hard  the  past  few  months  arranging  the  details  of 
the  Congress,  I  am  very  happy  that  the  Closing  Ceremony  has  come  about.  1  have  the 
impression  that  the  Congress  events  proceeded  fairly  smoothly  and  properly.  From  what  I  saw 
in  the  various  halls,  the  lecture  and  poster  sessions  were  well  attended  with  some  very 
intensive  discussions. 

For  most  of  you,  the  participants,  this  Congress  was  probably  the  occasion  for  your  first 
visit  to  Israel.  To  all  of  you,  we  wish  to  invite  you  to  come  again  whenever  you  can  to  visit 
the  various  universities  and  the  many  sights  of  interest.  We  greatly  value  and  appreciate  the 
continuous  cooperation  of  the  international  Mechanics  community  and  have  been  honored  to  be 
your  hosts  on  this  occasion.  The  word  in  Hebrew  is  lehitraot  -  meaning,  see  you  again.” 

Then  Professor  H.  Keith  Moffatt,  Secretary  of  the  Congress  Committee,  presented  his 
report: 

"Mr  President,  Dear  Colleagues, 

I  think  you  will  agree  that  there  is  something  quite  unique  about  the  International  Congress 
of  Theoretical  and  Applied  Mechanics,  or  ICTAM  as  it  is  affectionately  known.  It  is  not  just 
another  Conference  or  Symposium  to  add  to  the  many  that  are  held  annually;  either  nationally 
or  internationally.  Its  uniqueness  lies  in  the  great  historical  tradition  that  goes  back  to  the  first 
Congress  of  Delft  in  1924,  and  that  has  been  maintained  and  strengthened  through  the 
succeeding  decades.  On  your  behalf,  the  Congress  Committee  strives  to  maintain  the  standard 
of  excellence  established  in  earlier  years,  and  I  believe  that  this  XVIIIth  Congress  in  Haifa  has 
been  quite  exceptional  in  the  quality  and  diversity  of  the  lectures  delivered  and  the  posters 
displayed  and  defended.  Our  Opening  and  Closing  Lecturers  and  our  invited  Sectional 
Lecturers  have  given  us  a  brilliant  survey  of  work  in  the  fields  of  fluid  and  solid  mechanics, 
and  I  wish  to  thank  them  collectively  for  the  great  effort  they  have  devoted  in  preparing  their 
lectures  and  delivering  them  here  in  Haifa. 

The  three  mini-symposia  have  been  equally  successful  and  the  introductory  lectures  have 
been  extraordinarily  stimulating  and  instructive.  1  wish  to  thank  the  Chairmen  and  Co- 
Chairmen  of  these  mini-symposia  for  their  efforts  in  constructing  these  programs,  and  the 
Lecturers  for  their  willing  cooperation. 

I  wish  to  say  a  particular  word  about  the  Poster-Discussion  Sessions.  The  Congress 
Committee  has  always  maintained  that  papers  accepted  for  these  Sessions  must  be  at  least  equal 
in  quality  to  the  standard  that  we  set  for  contributed  lectures.  The  high  standard  of  Posters  at 
this  Congress  and  the  lively  discussions  that  they  have  provoked  testify  to  the  success  of  this 
policy.  The  poster-discussion  sessions  promote  the  development  of  informal  contacts  that  can 
lead  to  future  research  collaboration,  and  I  believe  that  they  form  an  important,  integral  and 
indispensable  part  of  our  Congress  structure.  Participants  enjoy  poster-discussion  sessions 
precisely  because  of  their  participatory  character,  and  time  is  available  for  discussion  of 
controversial  issues,  in  a  way  that  is  generally  not  possible  within  the  strict  time  constraints  of 
conventional  lecture  sessions. 

The  number  of  participants  at  this  Congress  has  been  low,  although  what  we  have  lacked 
in  quantity  we  have  made  up  in  quality  .  Nevertheless,  the  decline  in  numbers  is  a  matter  of 
great  concern  to  the  Congress  Committee,  and  we  believe  that  this  trend  can  be  reversed  only 
by  a  vigorous  campaign  in  all  the  40  countries  of  the  Union  to  raise  awareness  of  the 
importance  of  the  Congress,  and  to  raise  funds  at  National  level  to  support  increased 
participation.  I  invite  you  ail  to  advertise  the  Congress  to  your  colleagues  and  your  graduate 
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students,  so  that  they  may  already  set  their  sights  on  participation  in  the  XIXth  Congress  in 
August  1996,  in  a  location  that  will  soon  be  announced  by  the  President. 

The  Congress  Committee  is  eager  to  be  responsive  to  any  constructive  criticisms  or 
suggestions  that  participants  may  wish  to  make,  which  may  influence  the  planning  of  future 
Congresses.  If  you  have  any  suggestions,  please  write  to  me  and  I  shall  make  sure  that  the 
suggestions  receive  careful  consideration. 

Now  I  have  to  report  the  impending  retirement  of  nine  members  of  the  Congress 
Committee  and  the  election  of  nine  new  members.  The  retiring  members  are: 

Professors  Bevilacqua,  Collins,  Drucker,  Hult,  Imai,  Ishiinsky,  Lighthill,  Muller  and 

Niordson. 

I  wish  to  thank  all  for  their  cooperation  in  the  work  of  the  Committee,  and  particularly 
Professor  Dan  Drucker,  Sir  James  Lighthill  and  Professor  Frithiof  Niordson  who,  as  past 
Presidents  of  IUTAM.  have  served  also  with  great  distinction  as  Presidents  of  the  Congress 
Committee. 

The  new  members,  elected  by  the  General  Assembly  are,  in  alphabetical  order: 

Professors  Aref  (USA),  Bodner  (Israel),  Engelbrecht  (Estonia),  Hutchinson  (USA), 

Lundberg  (Sweden),  Gen  Meier  (Germany),  Pediey  (UK),  Sayir  (Switzerland)  and 

Tatsumi  (Japan). 

I  wish  to  welcome  these  new  members  whose  election  to  the  Committee  takes  effect  on  1 
November  1992 

The  new  Executive  Committee  was  appointed  by  the  Congress  Committee  this  morning  and  is 
as  follows: 

Professor  van  Wijngaarden  -  President 

Professor  Olhoff  -  Secretary 

Professor  Acrivos,  Professor  Kaliszky,  Professor  Moffatt,  Professor  Bodner. 

The  Congress  Committee  is  honoured  to  have  two  members  who  have  served  for  many 
years  and  who  now  serve  without  limit  of  tenure.  These  are  Professor  Nicholas  Hoff,  who 
was  President  of  the  Stanford  Congress  in  1968  and  who  we  have  been  very  happy  to  see  so 
well  and  active  here  in  Haifa  throughout  the  week  of  the  Congress;  and  Professor  Y.H.  Ku, 
formerly  of  the  National  Chengchi  University,  Nanking,  China,  now  Emeritus  Professor  at  the 
University  of  Pennsylvania,  USA.  Professor  Ku  celebrated  his  90th  birthday  last  year,  and  his 
collected  scientific  papers  of  the  last  twenty  years  have  just  been  published  by  the  Shanghai 
Jiaotong  University  Press.  I  am  sure  you  would  wish  to  join  me  in  congratulating  him,  and 
wishing  him  well  for  the  future. 

Finally,  President,  I  wish  to  thank  Josef  Singer  and  the  Local  Organising  Committee  and 
particularly  its  Secretary,  Alex  Solan,  for  their  devoted  efforts  in  making  this  Congress  run  so 
smoothly;  it  is  a  Congress  from  which  we  have  all  gained  intellectual  stimulus  and 
enlightenment,  and  on  which  we  will  all  look  back  with  great  warmth  and  affection." 

The  President  of  IUTAM,  Professor  Paul  Germain  then  said: 

"Dear  Colleagues  and  friends, 

You  have  heard  the  report  of  the  Secretary  of  the  Congress  Committee,  Professor  Moffatt. 
I  strongly  approve  his  statements:  from  the  scientific  point  of  view,  the  XVIIIth  1CTAM  was  a 
very  good  Congress.  I  am  sure  that  all  of  you  agree  with  me  when  I  express  our  best  thanks  to 
the  Congress  Committee  and  in  particular  to  Keith  Moffatt,  to  the  International  Papers 
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Committee  which  had  to  select  the  papers  to  be  presented,  a  very  difficult  task  and  a  great 
responsibility  indeed,  to  the  Local  Organizing  Committee,  in  particular  Professor  Solan  and 
Professor  Bodner,  who  had  the  duty  to  organize  the  pasgramme  taking  into  account  all  the 
constraints  and  requirements,  and,  last  but  not  least,  to  the  contributors  who  have  delivered 
their  best  results  and  made  us  discover  new  problems,  new  methods  and  new  points  of  view. 

The  facilities,  very  well  managed  by  the  Local  Organizing  Committee,  were  also  of  high 
standard.  The  outside  temperature  was  pretty  high.  But  the  air-conditioning  was  running  so 
well  that  the  poor  Europeans  who  are  not  used  to  such  an  equipment  were  somewhat  freezing 
and  they  found  very  nice  to  go  outside  from  time  to  time  in  older  to  get  some  warm  air.  Only 
one  disappointment:  we  were  expecting  a  more  numerous  attendance. 

I  have  now  the  very  pleasant  duty  to  conclude  the  present  Congress  by  announcing  who 
receives  the  two  IUTAM  Bureau  prizes.  I  remind  you  that  such  a  prize  is  offered  to  a  young 
scientist  for  an  outstanding  presentation  of  a  good  paper.  Two  prizes  are  distributed,  one  in 
fluid  mechanics  and  one  in  solid  mechanics.  The  fluid  mechanics  prize  is  given  to  Dr.  Chen 
Xie  Nung  for  an  outstanding  poster  presentation  of  a  paper,  co-authored  by  Dr.  Sharma, 
entitled  "Slender  Ship  Moving  at  Near-Critical  Speed  in  a  Shallow  Channel"  .  It  is  shown  by  a 
theoretical  and  a  numerical  analysis  how  the  steady  motion  of  the  ship  produces  the  unsteady 
generation  of  a  train  of  solitary  waves  in  front  of  it.  The  Solid  Mechanics  prize  is  given  to  Mr. 
S.D.  Guest,  a  graduate  student  of  the  University  of  Cambridge  for  the  very  clear  presentation 
of  the  paper  "Propagation  of  Destabilizing  Waves  in  the  Folding  of  Faceted  Tubes",  co¬ 
authored  by  Dr.  S.  Pellegrino.  The  lecture  was  of  high  quality.  The  non  linear  complex 
problem  dealing  with  wave  propagation  and  the  motion  of  bifurcations  was  clearly  outlined  and 
excellently  illustrated  by  two  very  well  chosen  models. 

We  have  now  to  look  at  the  future.  I  have  the  pleasure  to  announce  that  the  Congress 
Committee  after  many  discussions  held  during  two  meetings  has  decided  that  the  XIXth 
ICTAM  will  take  place  in  Kyoto.  The  Local  Organizing  Committee  will  be  chaired  by 
Professor  Tatsumi.  We  are  all  confident  that  this  first  ICTAM  in  Asia  will  be  the  occasion  of  a 
deep  renewal  of  IUTAM.  We  expect  a  numerous  attendance  from  Far  East  Asia.  Professor 
Moffatt  did  not  want  to  continue  to  be  Secretary  General  after  8  years  of  very  hard  and  very 
efficient  work  devoted  to  ICTAM.  Our  thanks  again  to  Keith  Moffatt.  Professor  Olhoff  will  be 
his  successor.  He  is  well  prepared  to  the  job  with  the  organization  of  the  XVIth  ICTAM  in 
1984  and  his  experience  of  the  International  Papers  Committee  in  1988  and  1992. 

The  General  Assembly  has  chosen  19  symposia  which  will  be  held  in  1994  and  1995,  and 
has  elected  the  new  Bureau.  Professor  Leen  van  Wijngaarden  will  be  our  next  President, 
Bruno  Boley  the  Treasurer,  Franz  Ziegler  the  General  Secretary.  The  other  members  will  be 
Professors  Chemyi,  Moffatt,  Schiehlen  and  Tatsumi.  I  want  to  express  my  deep  gratitude  after 
these  4  years  during  which  I  had  the  great  honour  to  be  the  President  of  IUTAM.  Needless  to 
say  that  I  will  always  keep  the  best  recollection  of  these  4  years  I  have  spent  with  colleagues 
and  friends,  all  devoted  like  me  to  the  promotion  and  to  the  development  of  Mechanics,  all  over 
the  world. 

Thank  you  all  of  you  for  your  dynamic  participation.  Good  luck  to  everybody.  I  hope  that 
all  of  us  will  be  in  Kyoto  in  1966.  '' 

Professor  Germain  then  closed  th**  ?Rth  International  Congress  of  Theoretical  and  Applied 
Mechanics. 
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Abstract 

Increasing  attention  is  being  paid  to  the  large  scale  structure  of  turbulence  and  to  the  so- 
called  “coherent”  vortical  structures  which  have  been  disclosed  and  studied  for  a  number  of 
turbulent  shear  flows  in  laboratory  experiments  and  in  numerical  simulations.  The  coherent 
structures  develop  from  the  instability  waves  which  create  the  flow;  they  portray  the  genesis  of 
the  turbulence  in  the  primary  instability  of  the  global  vorticity  distribution.  This  is  not  in  the 
sense  of  classical  laminar  instability,  which  initiates  transition  to  turbulence,  but  as  the  driving 
instability  in  the  fully  developed  turbulence.  That  instability  provides  the  link  to  the  amplitude 
of  the  turbulent  motion,  which  in  classical  turbulent  modelling  must  be  calibrated  empirically 
as  a  fundamental  step  or  steps  in  the  closure  of  the  Reynolds  averaged  equations  of  motion. 
It  also  rationalizes  the  dependence  on  various  parameters,  such  as  compressibility,  clarifies 
response  to  external  disturbances,  and  suggests  the  possibility  of  “turbulence  control”.  The 
global  instability  is  being  incorporated  into  new  models  of  turbulent  shear  flow  development 
and  the  coherent  structures  form  the  basis  for  new,  Lagrangian  models  of  chemical  mixing  and 
reaction  in  these  flows. 


1.  INTRODUCTION 

Our  knowledge  of  turbulent  flow  is  still  mainly  empirical.  Feynman  has  called  turbulence 
“the  greatest  puzzle  in  classical  physics”.  It  has  been  a  prominent  topic  at  these  Congresses 
of  Theoretical  and  Applied  Mechanics.  That  it  continues  to  attract  the  attention  and  energy  of 
impressively  large  numbers  of  researchers  attests  to  its  technological  importance  and,  just  as 
importantly,  to  its  intellectual  challenge. 

Since  the  recognition  over  a  century  ago  of  its  importance  in  engineering,  the  problem  of 
turbulence  has  developed  in  many  directions  and  presents  different  points  of  view  and  different 
goals  for  various  schools  of  research.  Still  the  ultimate  objective  for  applied  mechanics  is  the 
original  one,  to  model  and  “predict”  turbulent  fluxes  of  momentum  and  of  passive  scalars  such 


as  heat.  These  fluxes  are  at  the  heart  of  practical  technological  problems  such  as  the  need 
to  predict  aerodynamic  drag  or  to  compute  mixing  rates  in  propulsive  devices,  in  chemical 
engineering  processes  and  in  tiie  upper  atmosphere,  to  name  just  a  few  examples.  Equally 
important  as  the  urgent  technological  motivation  has  been  an  intense,  continuing  interest  to 
understand  turbulence  in  a  fundamental,  scientific  way;  and  each  has  benefited  from  the  other. 
The  pursuit  of  these  two  aspirations  has  been  largely  within  the  context  of  two  dominant 
approaches  to  the  turbulence  problem,  namely  Reynolds-equation  modelling  and  the  statistical 
theory  of  turbulence. 

The  equations  obtained  by  Reynolds  in  1 895  may  be  written  in  their  simplest  form  for  flow 
which  is  incompressible  (density  p  =  const)  and  in  which  the  mean  values,  denoted  by  an 
overbar,  (  ),  are  steady.  Derived  in  an  Eulerian  frame,  the  equations  are 
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The  crucial  result  of  “Reynolds  averaging”  is  to  introduce  into  these  mean-flow  equations  the 
term  Tt]  =  -pu'pii,  which  denotes  mean  momentum  flux  resulting  from  correlated  fluctuation 
of  momentum  pu\  and  velocity  u'  at  a  point  in  the  flow.  From  its  juxtaposition  with  the  viscous 
stress  f ij  in  the  momentum  equation,  Tt)  is  called  the  “Reynolds  stress”.  For  a  Newtonian  fluid, 

the  viscous  stress  is  related  to  the  velocity  field  by  a  linear  relation,  ftJ  =  /i( +  ^U] 
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where  /t  is  the  coefficient  of  viscosity.  For  a  turbulent  flow,  to  relate  the  Reynolds  stress  to  the 
velocity  field  is  the  great  problem  of  "closure”  for  the  Reynolds  equations. 

Other  equations  of  this  type  include  one  for  turbulent  transport  of  a  passive  scalar  c: 
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where  qt  =  ~D  and  Qt  =  c'u't.  Still  others  have  been  developed  over  the  years.  For 
example  it  is  possible  to  write  an  equation  for  the  turbulent  kinetic  energy,  k'  =  - 
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The  first  term  on  the  right  hand  side  denotes  the  rate  of  production  of  turbulent  energy  by 
the  Reynolds  stress  while  the  last  one  is  the  rate  of  viscous  dissipation.  The  redistribution 
terms  (RDT)  contain  new  correlations  such  as  p'u\  and  k'u't.  It  is  also  possible  to  write  a 
corresponding  equation  for  the  Reynolds  stress  itself: 
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with  additional  new  correlations,  e.g.  />u'Ku'tulj,  appearing  in  the  RDT.  Thus,  introducing  such 
higher  order  equations  escalates  the  number  of  correlations  which  have  to  be  “modelled"  for 
closure,  but  this  may  still  be  advantageous  because  now  the  fundamental  quantity  TtJ  is  nor 
modelled;  its  evolution  is  compand  from  Eq.  (4)  in  conjunction  with  Eqs.  ( 1 )  and  (2)  (Launder, 
1990).  We  shall  refer  to  the  hierarchy  of  such  equations  as  the  “Reynolds-averaged  equations" 
or,  symbolically,  "Rc  equations”  and,  correspondingly,  to  “Re.  modelling". 

The  statistical  theory  of  turbulence,  introduced  by  Taylor  in  1935  and  developed  by  Batch¬ 
elor  and  others,  is  for  homogeneous  fields,  thus  does  not  address  the  turbulent  fluxes  and  the 
all  important  question  of  the  turbulence  production.  Instead  it  sets  out  to  develop  a  rational 
description  and  theory  of  a  simpler  turbulent  problem,  through  equations  for  two-point  correla¬ 
tion  functions.  For  example,  f(r)  =  u\(x)u\(x  +  r)  j  u\{x)u\{x)  correlates  the  fluctuating 
velocity  component  u[(t)  at  two  points  separated  by  a  distance  r.  This  approach  has  produced 
a  very  useful  language  and  framework  for  developing  ideas  and  concepts  in  turbulence  and  has 
led  to  important  results,  for  example  Kolmogorov’s  theory  of  an  inertial  sub-range  of  turbulence 
dynamics. 

In  Rc  modelling  as  well  as  in  the  statistical  theory,  scaling  methods  have  played  a  very 
important  role.  One  might  even  say  that  they  have  given  us  most  of  our  insights  into  turbulence 
and  have  provided  the  best  vehicle  for  organizing  our  empirical  knowledge  of  it.  At  its  simplest 
level,  scaling  makes  use  of  dimensional  analysis  and  similarity  concepts  to  make  predictions 
about  the  scaling  parameters  in  particular  canonical  flows. 

An  instructive  illustration  of  this  is  the  turbulent  jet,  an  example  of  which  is  shown  in 
Figure  1 .  It  is  a  photo  of  a  rocket  exhaust  plume  on  a  test  stand.  The  thrust  F  of  the  rocket  is 
quoted  as  “3  million  pounds".  With  that  we  can  infer  that  the  jet  Reynolds  number  in  the  far 
field.  Re  =  y/pF/p.,  is  2  x  109.  At  this  value  of  Reynolds  number  the  magnitude  of  the  mean 
viscous  stress  j  ftJ  \  in  Eq.  (2)  is  only  about  10~7  compared  to  the  Reynolds  stress  and  cannot 
directly  affect  the  development  of  the  mean  flow.  Thus  it  is  assumed  that  the  viscosity  //,  drops 
out  of  the  group  of  governing  parameters.  This  assumption  that,  at  high  values  of  Rc,  turbulent 
free-shear  flow  fields  do  not  depend  on  the  viscosity  has  support  from  various  experimental 
evidence.  It  implies  that  TtJ  is  independent  of  Re  and  that,  in  Eq.  4,  the  dissipation  term  is  also 
independent  of  Re;  i.e.  the  velocity  gradients  adjust  themselves,  with  changing  //,  to  balance 
the  dissipation  required  by  the  /^.-independent  production  term.  (In  wall  flows,  the  viscous 
terms  cannot  be  omitted  even  in  Eq.  2,  because  near  smooth  walls  they  become  dominant.)  It 
also  follows  that  sufficiently  far  from  the  origin  of  the  jet,  in  the  “far  field”  where  the  initial 
geometrical  details  should  no  longer  be  important,  the  jet  development  depends  only  on  the 
distance  x  from  its  origin  and  on  global  parameters,  namely  the  thrust  F  and  the  ambient  density 
p.  Dimensional  analysis  then  tells  us  that  the  jet  thickness  h(x)  must  be  proportional  to  x  and 
the  jet  grows  at  a  constant  rate  dhjdx  which  must  be  the  same  for  all  free  jets.  Thus  laboratory 
measurements  at  Re  =  H)4,  where  \fn\  ~  1()~2|  TtJ  | ,  “predict”  that  for  all  higher  Reynolds 


Fig.  1.  Turbulent  jet  from  a  rocket  on  a  test  stand.  (From  Los  Angeles,  Lane 
Magazine  and  Book  Co.,  Menlo  Park,  California. 

numbers  the  turbulent  jet  cone,  made  visible  by  some  marker  as  in  Fig.  1,  will  have  an  angle  of 
about  23°.  An  important  property  of  the  jet  is  the  rate  at  which  it  entrains  surrounding  fluid. 
It  literally  sucks  or  induces  fluid  into  itself,  so  that  its  mass  flow  rate  increases  continually 
along  its  axis:  the  entrained  mass  flow  rate  is  me( x).  Again  from  dimensional  analysis,  this 
entrainment  rate  can  be  related  to  the  thrust  and  the  surrounding  fluid  density  by  the  formula 
dme/dx  =  const. y/pF  .  From  an  experiment  by  Ricou  and  Spalding  (1961)  at  Re  up  to  105, 
the  value  of  const,  is  known  to  be  0.28.  With  this  we  can  “predict”  that  at  3  million  pounds 
of  thrust  the  jet  in  Fig.  1  was  entraining  air  at  the  rate  1 100 Kg/sec/meter.  (This  result  will  be 
modified  in  early  parts  of  the  jet,  where  the  gases  are  still  hot  and  the  density  p  is  not  uniform, 
and  near  the  top  of  the  jet,  where  buoyancy  may  play  a  role.)  Such  entraining  capability  is  the 
basis  for  jet  pumps,  ejectors  and  thrust  augmentors. 


Similar  and  more  sophisticated  applications  of  dimensional  and  similarity  arguments 
(c.f.  Monin  and  Yaglom,  1971;  Tennekes  and  Lumiey,  1972;  Barenblatt,  1979)  have  led  to 
a  large  number  of  useful  results  and  insights  into  various  turbulent  flows.  Indeed,  the  use  of 
“eddy  viscosity”  and  “mixing  length”  formalisms  in  self-similar  turbulent  flows  can  perhaps 
be  better  justified  as  examples  of  scaling  rather  than  “gradient-diffusion”  methods.  A  notable 
result  from  this  approach  is  von  Karman’s  use  of  “mixing  length  theory”  to  derive  the  log  law 
for  turbulent  boundary  layers  and  to  introduce  his  famous  constant  k. 

One  impact  of  Re  averaging  and  of  the  statistical  theory  was  to  tend  to  encourage  a  view 
of  “fully  turbulent”  flow  as  too  complicated  and  disorganized  to  contain  structural  features  that 
could  be  usefully  incorporated  into  any  model.  By  design,  Re  modelling  and  the  classical 
statistical  theories  deal  only  with  mean  quantities;  the  effects  of  turbulent  motion  appear  in 
the  statistical  correlations.  In  the  Re  equations  these  are  correlations  at  a  point,  containing 
no  information  about  spatial  or  temporal  structure  of  the  turbulence.  In  the  statistical  theory, 
spatial  correlation  functions  (and  space-time  correlations)  do  provide  some  information  about 
turbulent  structure.  Such  spatial  correlation  functions  can,  in  principle,  also  be  introduced  into 
the  Re  equations  (Batchelor,  1953;  Lesieur,  1986).  In  recent  work  (e.g.  Lundgren,  1982;  Pullin 
and  Saffman,  1993),  efforts  are  being  made  to  design  mathematical  models  of  physical  vortical 
turbulent  structure  for  special  regimes  such  as  Kolmogorov’s  universal  equilibrium  regime. 
Generally  however,  with  a  few  such  exceptions,  arguments  in  turbulence  modelling  and  theory 
make  much  use  of  imagery  —  “eddies”;  “transfer  of  energy  between  eddies”;  “interfaces”; 
“large  and  small  structure”,  etc.  —  without  the  force  of  true,  physically  appropriate  models. 
There  is  continuing  need  for  such  models,  either  for  helping  guide  classical  modelling  or  for 
discovering  new  approaches. 

2.  ORGANIZED  STRUCTURE  IN  TURBULENCE 

The  preceding  outline  of  the  two  dominant  approaches  to  turbulence  provides  background 
for  discussing  in  the  main  part  of  this  talk  another  trend,  possibly  going  back  to  Leonardo,  in 
which  it  is  recognized  that  patterns  of  organized  structure  exist  in  in  turbuient  shear  flow's. 

After  about  1940  experimental  work  on  turbulent  flow  was  largely  steered  in  the  two  main 
directions  of  /?.e-modelling  and  statistical  theory.  The  hot-wire  anemometer  which  had  been 
perfected  was  able  to  obtain  accurate  time  histories  of  velocity  fluctuations  u'(t)  and  thus  to 
provide  the  various  correlations  needed  to  model  the  Re  equations  as  well  as  the  correlation 
functions  and  spectra  needed  in  the  statistical  theory.  Highly  successful  in  these  tasks,  the  hot 
wire  did  not  at  first  reveal  anything  that  could  be  called  organized  structure.  For  one  thing, 
there  was  little  incentive  to  examine  or  analyze  the  velocity  data  for  anything  but  the  mean 
quantities  and,  in  any  case,  even  if  examined  in  detail  the  fluctuation  history  at  a  single  point 
in  any  turbulent  flow  is  not  revealing,  nor  are  the  single-point  correlations  used  in  the  Re 
equations.  But  two  point  correlations  of  the  fluctuation  histories  at  two  different  points  do  give 
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a  start  on  defining  spatial  structure.  They  were  first  used  by  Townsend  (1956)  and  his  school, 
e.g.  Grant  (1958),  to  find  evidence  of  organized  vortical  structure  in  far  wakes  and  boundary 
layers.  Space-time  correlations,  introduced  by  Favre,  Gaviglio  and  Dumas  (1958),  revealed 
moving  patterns  with  defineable  celerities,  scales  and  lifetimes  in  boundary  layers  and  in  free 
shear  layers  (Fisher  and  Davies,  1964).  All  these  were  the  first  intimations  of  "organized”  or 
“coherent”  structure  to  be  extracted  from  hot  wire  measurements  in  fully  developed  turbulent 
flows. 

Actually,  examples  of  coherent  structure  in  flows  with  turbulence  but  with  a  deterministic 
scale  were  abundant.  Most  prominent  is  the  example  of  vortex  shedding  from  bluff  bodies 
and  the  corresponding  Karman  “vortex  street”.  For  a  circular  cylinder,  vortex  “shedding” 
occurs  from  Re.  =  50  to  the  highest  values  observed  in  the  laboratory  (Re  =  107),  and  to  still 
higher  values  in  (less  well  controlled)  geophysical  flows.  Turbulent  motion,  superimposed  on 
the  organized,  periodic  motion,  appears  already  at  Re  ~  102.  But  the  periodic  creation  and 
shedding  of  vortical  structures  is  dominant,  at  a  well  defined  frequency  fs  that  is  related  to  the 
body  dimension  and  the  velocity  of  the  cylinder,  expressed  non  dimensionally  in  the  Strouhal 
number,  S  =  f$d/Ux.  Therefore  this  was  not  usually  considered  to  be  a  “fully  turbulent  flow”, 
which  by  implication  is  (statistically)  featureless.  The  power  spectrum  of  velocity  fluctuation 
in  the  near  wake  of  a  bluff  body  has  a  sharp  peak  at  the  Strouhal  frequency,  superimposed  on  a 
broad  backgiound,  in  contrast  with  the  featureless  spectra  of  homogeneous  turbulence. 

Far  downstream  of  the  position  of  the  cylinder,  in  the  “far  wake”,  the  sharp  spectral  peak  at 
fs  disappears.  It  was  in  that  region  of  “fully  developed  turbulence”  that  Townsend  and  Grant 
found  evidence  of  defineable  structure.  Taneda  (1959),  using  flow  visualization,  found  that  the 
original  vortex  street  breaks  down  at  some  distance  downstream,  to  be  replaced  by  one  with 
larger  scale  but  similar  in  its  side  view  appearance.  Quoting  him,  “After  this,  however,  the 
wake  shows  a  strong  tendency  to  rearrange  itself  again  into  the  configuration  of  the  Karman 
vortex  street.  In  this  way  the  formation  and  the  breakdown  of  the  regular  vortex  street  occur 
alternately  as  the  distance  from  the  obstacle  is  increased,  the  wave  length  becoming  larger 
and  larger  with  every  transition  from  one  vortex  street  to  the  next.  Even  when  the  regular 
vortex  street  is  not  formed,  the  dimension  of  the  predominant  discrete  vortices  which  appear 
in  the  wake  always  increases  as  the  distance  from  the  body  increases.”  The  Reynolds  numbers 
in  Taneda’s  experiments  were  so  low  (<  300)  that  they  were  probably  not  seen  as  examples 
of  turbulent  flow  but  later  experiments  by  Cimbala  (1984)  at  higher  values  of  Re  (~  5000), 
using  flow  visualization  and  hot-wire  measurements,  showed  similar  structure  appearing  far 
downstream  where  there  was  no  Strouhal  spectral  peak  in  the  spectrum.  Indeed,  the  wake 
shown  in  Fig.  2  was  downstream  of  a  porous  flat  plate  (a  strip  of  screen  placed  normal  to 
the  stream)  which  did  not  shed  vortices  at  all.  At  x/d  —  10  the  wake  contains  only  small 
scale  turbulence  in  fluid  which  has  gone  through  the  screen.  Between  xjd  —  50  and  70  there 
is  a  burst  of  organized  structure,  quite  similar  to  those  observed  by  Taneda.  While  in  this 
edge  view  it  looks  like  a  Karman  “street”,  views  normal  to  the  plane  of  the  wake  show  it  to 
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Fig.  2.  Wake  of  porous  flat  plate  with  solidity  47%.  Re  =  GOOO.  Visualized  by  a 
smoke  wire  placed  at  x/d  =  0  and  x/d  =  44,  resp.  (Cimbala,  1984) 

be  three  dimensional,  suggestive  of  the  mean,  correlation  structure  inferred  by  Townsend  and 
Grant.  At  other  times,  at  the  same  location,  the  wake  may  be  more  disorganized,  or  contain 
a  burst  of  coherent  structure  with  a  different  scale.  Thus,  no  fixed  frequency  shows  up  in 
spectral  measurements  of  velocity  at  a  fixed  position  but,  rather,  a  broad  spectrum  with  a  well 
defined  maximum  peak  which  shifts  continuously,  to  lower  frequency  with  increasing  distance 
downstream,  i.e.  with  increasing  wake  scale. 

These  trends  in  Ttneda’s  and  Cimbala’s  experiments  illustrate  general  features  of  coherent 
structure  in  fully  developed  turbulent  flow  that  had  also  emerged  from  experiments  on  free 
shear  layers  (also  called  “mixing  layers”).  In  particular  it  was  pictures  like  the  one  in  Fig.  3, 


Fig,  3.  Shadowgraphs  of  mixing  layer  with  velocity  ratio  U2/U\  =0.38.  Density 
uniform.  {U\  -  U2)6/o  =  5  x  104,  based  on  scale  of  largest  structures. 
Simultaneous  edge  and  (partial)  plan  views.  (Konrad,  1976). 
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showing  vortical  structures  in  a  matrix  of  small  scale  turbulence,  that  helped  confirm  various 
earlier  indications  (Liepmann,  1 952)  that  organized  structures  exist  in  “fully  developed  turbulent 
flows”  and  encouraged  efforts  to  understand  their  significance  and  possible  uses. 

The  following  is  a  brief  account  of  some  of  the  developments  from  those  efforts,  with 
emphasis  on  the  theme  of  this  paper  that  the  coherent  structures  in  turbulent  shear  flow  result 
from  and  exhibit  the  instability  of  the  underlying,  global  mean  flow.  We  discuss  mainly  jets, 
wakes  and  mixing  layers,  which  are  examples  of free  shear  flows,  contrasted  with  wall-bounded 
flows  such  as  pipe  flows  and  boundary  layers,  in  which  manifestations  of  coherent  structure 
are  also  found  (Kline  et  al.,  1967,  Robinson,  1991).  The  wall  bounded  flows  are  considerably 
more  complicated  than  the  free  shear  flows  and  identification  of  the  coherent  structures  in  them 
is  more  difficult.  Wall  flows  have  a  two-scale  structure  while  free  shear  flows  have  single-scale 
structure,  characterized  by  some  measure  of  the  local  thickness  6{x).  At  sufficiently  high 
Reynolds  number  the  development  of  free  shear  flows  may  be  independent  of  viscosity,  as 
indicated  above.  In  wall  flows,  on  the  other  hand,  the  no-slip  condition  at  the  wall  ensures 
that  viscosity  is  always  “in  the  picture”,  creating  a  two-scale  structure.  The  fundamental 
difference  is  also  reflected  in  the  stabilities  of  the  underlying  mean  flow  profiles.  Free  shear 
flows  are  variously  described  as  having  “inviscid”,  “dynamic”,  “inflectional-point”  or  “fast" 
instability;  wall  bounded  flows  are  “inviscidly  stable”  or,  at  best,  have  “slow”  instability. 
The  no-slip  condition,  the  two-layer  structure  and  the  slow  instability  in  wall  flows  results  in 
coherent  structure  which  is  quite  different  and  more  difficult  to  understand  than  that  in  the 
free-shear-layer  class.  In  this  paper  only  the  latter  is  addressed. 

In  earlier  paragraphs  and  in  the  following,  reference  is  made  to  “fully  turbulent  flow”.  This 
is  a  term  that  has  been  used,  usually  in  connection  with  the  self-similar  canonical  shear  flows 
(jets,  wakes,  mixing  layers,  boundary  layers),  to  characterize  the  flow  state  at  high  Reynolds 
number  and  sufficiently  far  downstream  to  have  “forgotten”  the  details  of  the  initial  conditions. 
It  has  also  tended  to  suggest  the  loss  of  defineable  structural  features.  To  put  this  in  perspective, 
it  is  important  to  make  a  distinction  between  the  “near  field”  and  the  “far  field",  e.g.  the  near 
wake  and  far  wake  of  a  bluff  cylinder.  The  near  field  depends  on  initial  conditions  and  scales 
with  the  initial  parameters,  e.g.  the  vortex  shedding  frequency  is  locked  in  to  the  cylinder 
diameter.  In  contrast,  the  far  field  has  forgotten  the  details  of  the  initial  conditions,  depends 
only  on  global  parameters  such  as  the  drag  force,  and  scales  with  local  parameters,  such  as  the 
local  thickness  6(x),  which  are  evolving.  These  distinctions  between  near  and  far  fields  carry 
over  to  coherent  structures.  In  both  cases  the  structures  have  their  genesis  in  the  local  instability 
of  the  flow,  but  they  are  more  easily  recognizeable  in  the  near  field  because  of  fixed  scales.  In 
the  near  field  the  spectrum  of  coherent  structure  is  sharp  because  the  scales  are  sharply  defined; 
in  the  far  field  the  scales  of  coherent  structures  passing  a  given  point  are  broadly  distributed. 


3.  STRUCTURE  IN  FREE  SHEAR  LAYERS 

A  good  introduction  to  the  relation  of  instability  to  coherent  structure  in  free  shear  layers  is  a 
picture.  Fig.  4,  obtained  by  Freymuth  (1966),  of  the  initial  instability  development  in  a  laminar 
free  shear  layer.  The  initial  part  of  this  picture  displays  the  expected  Kelvin-Helmholtz  (K-H) 
two  dimensional  “laminar”  instability,  whose  beginnings  are  accurately  predicted  by  Rayleigh’s 
linear  stability  equation  for  inviscid  flow.  The  exponential  growth  of  the  velocity  and  voilicity 
perturbations  causes  thickening  of  the  mean  shear  layer,  which  eventually  closes  the  instability 
window  and  cuts  off  the  wave  development,  by  which  time  the  wave  has  steepened  into  a 
vortical  structure;  we  call  it  a  “coherent”  structure.  Its  scale  is  related  to  the  thickness  of  the 
initial  laminar  layer.  The  fluctuating  velocity  has  a  characteristic  frequency,  connected  to  the 
initial  thickness  and  the  initial  velocity.  These  are  near-field  features. 


Fig.  4.  Kelvin  Helmholtz  instability,  vortex  formation  and  pairing  in  a  free  shear 
layer.  (Freymuth,  1966). 

The  right  hand  side  of  the  picture  shows  a  remarkable  phenomenon,  the  “pairing”  of  the 
vortices  which  form  at  the  end  of  the  first  instability.  This  depicts  a  subharmonic  response 
(Browand,  1966;  Ho  and  Huang,  1982)  of  the  thickened  layer  to  available  disturbances;  the 
K-H  instability  in  the  thicker  layer  that  has  emerged  from  the  initial  instability  is  repeated  at 
a  lower  wave  number.  That  is,  the  K-H  instability  results  in  velocity  fluctuations  «'(/.)  and 
v'(t)  which  cause  the  layer  to  grow  through  the  action  of  the  Reynolds  stress,  u'v',  that  is 
developed.  When  the  mean  thickness  has  grown  to  approximately  twice  the  initial  thickness 
the  “window”  of  instability  for  the  initial  wave  number  ends  and  the  layer  becomes  receptive 
to  a  lower  wave  number.  The  second  instability  may  be  viewed  either  as  the  active  interaction 
of  two  vortices  from  the  first  instability,  resulting  in  “pairing”,  or  as  the  “roll  up”  of  (nearly 
passive)  vortices  by  the  subharmonic  wave.  This  process  is  repeated  and  the  growth  of  the 
shear  layer  is  associated  with  repeated  “pairing”  (Winant  and  Browand,  1974)  or  amalgamation 
of  the  coherent  structures  into  larger  ones.  After  the  second  pairing  the  shear  layer  has  largely 
lost  memory  of  the  initial  conditions  and  is  approaching  a  far-field,  self-similar  development, 
with  scales  independent  of  initial  conditions. 
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More  of  the  far  field  is  shown  in  the  picture  in  Fig.  3  obtained  by  Konrad  (1976).  Here 
the  total  length  of  shear  layer  is  50  Ao,  where  Ao  in  the  initial  K-H  wave  length,  and  about 
5  pairings  or  amalgamations  have  occurred,  while  Freymuth’s  picture  spans  9  A0  and  only  1 
pairing  has  occurred.  Most  of  Freymuth’s  picture  is  in  the  near  field,  while  in  Konrad’s  picture 
the  near  field  is  to  the  left  of  the  “mixing  transition”  (Breidenthal,  1980)  marked  “mt”  . 

It  is  striking  that  the  flow  in  the  far  field  contains  coherent  structures  that  are  similar  to 
those  in  the  near  field.  But  there  are  fundamental  differences.  One  of  the  most  important  is 
that,  consistent  with  the  far-field  hypothesis,  there  is  no  fixed ,  external  reference  scale;  the  local 
mean  scale  of  coherent  structures,  i.e.  the  vortex  spacing  or  wave  length  is  related  to  the  local, 
evolving  scale,  i.e.  the  thickness  S(x).  Since  the  mean  thickness  b(x)  increases  smoothly,  the 
implication  is  that  the  spectrum  of  scales  (and  corresponding  frequencies)  at  given  x  cannot 
be  sharp,  since  there  is  nothing  special  about  any  value  of  x.  Indeed,  measurements  of  the 
spacing  A(z)  (Bernal,  1981,  1988)  or  of  frequency  f(x)  (Winant  and  Browand,  1974)  show 
a  broad  distribution  about  a  maximum  value.  The  implication  (and  observation)  is  that,  at  a 
given  position,  there  are  intervals  of  coherent  wave  trains,  each  with  different  instantaneous 
scale,  interspersed  with  intervals  of  disorganized  motion. 


Fig.  5.  Distribution  of  spacings  of  coherent  structures  in  a  turbulent  free  shear  layer. 
A(x)  is  the  mean  spacing  at  x.  Vertical  bars  are  from  measurement.  Curve 
is  a  log  normal  distribution.  (Bernal,  1981,  1988). 


How  do  the  properties  of  these  wave  trains  compare  with  those  of  classical  K-H  wave 
properties?  First,  as  already  mentioned,  the  spectrum  of  wave  lengths  or  spacings  A  of  vortices 
passing  a  given  position  x  are  broadly  distributed  around  a  mean  vlaue  A  (a:)  =  3.5  ^(.r), 
where  8^{x)  is  the  evolving  “vorticity  thickness”.  That  is,  although  the  large  structure  is 
coherent  it  is  not  deterministic.  The  distribution  (Fig.  5)  has  been  measured  by  Bernal  (1981, 
1988),  who  constructed  a  statistical  model,  following  the  lead  of  Takaki  and  Kovasznay  (1978), 
and  found  it  to  have  a  log  normal  distribution,  which  is  characteristic  of  a  fractionization  process 
or  its  opposite,  an  amalgamation  process. 

Second,  it  is  found  that  the  most  probable  wave  length  in  this  distribution  corresponds 
to  the  neutral  point  in  the  window  of  K-H  amplification  rates,  when  this  is  scaled  with  local 
parameters.  Correspondingly,  the  frequency  power  spectrum,  just  as  in  the  measurements  of 
Cimbala  in  the  wake,  has  a  (broad)  peak  at  the  neutral  point.  The  explanation  of  why  these 
maxima  appear  at  the  local  neutral  frequency,  rather  than  at  the  frequency  corresponding  to 
the  peak  in  the  amplification  curve,  is  as  follows  (Ho  and  Huerre,  1984;  Marasli,  Champagne 
and  Wygnanski,  1991).  A  wave  in  a  spatially  growing  layer  develops  at  fixed  frequency.  Its 
amplification  rate  depends  on  the  local  dimensionless  frequency  /  =  ffi[x)/Uc\ the  convective 
wave  speed  Uc  is  nearly  constant  but  the  thickness  h(x)  is  increasing.  Thus  /( x )  is  decreasing 
and  the  wave  eventually  reaches  a  position  x  where  /  is  at  the  neutral  value  and  the  wave 
cannot  grow  further.  It  is  now  at  its  maximum  amplitude  and  has  developed  into  a  “coherent” 
vortical  structure.  The  corresponding  frequency  and  the  wave  length,  or  vortex  spacing,  is 
the  most  probable  one  for  that  position.  That  is,  the  most  probable  vortex  spacing  at  given  x 
corresponds  to  the  frequency  with  the  greatest  opportunity  for  amplification  up  to  that  point. 

By  that  time  the  linear  theory  is  no  longer  accurate.  Gaster,  Kit  and  Wygnanski  (1985) 
made  measurements  of  the  velocity  fluctuations  in  a  mixing  layer  with  local  Re  =  1()4,  using 
small  forcing  to  provide  a  phase  reference.  They  measured  profiles  across  the  shear  layer  of 
the  velocity  fluctuation  amplitude,  in  effect  the  eigenfunction  of  the  instability,  and  found  that 
in  the  initial  stages  of  amplification  it  agrees  well  with  the  theoretical  shape,  showing  a  single 
maximum  and  two  side  lobes.  In  the  linear  theory  these  lobes  grow  and  become  dominant  at 
the  neutral  point;  in  the  experiment  the  tendency  is  the  same  but  quantitative  correspondence 
is  not  as  good  as  in  the  earlier  stages.  However  their  profiles  of  the  phase  distribution  across 
the  layer  corresponds  well  with  the  theoretical  ones  up  to  the  neutral  point. 

Similar  measurements  were  made  by  Marasli  et  al  (1991)  in  the  plane  wake  of  a  flat  plate 
at  x/8q  up  to  1400  and  local  Re  ~  103.  “For  the  unforced  flow,  the  peak  in  the  measured 
spectrum  of  the  cross-stream  (turbulent)  velocity  fluctuations  at  any  downstream  location  (in 
the  far  wake)  corresponds  to  the  local  neutral  frequency  from  linear,  spatial  stability  theory  for 
inviscid,  parallel  flow.”  That  is,  the  trends  were  similar  to  those  in  the  mixing  layer  and  to 
those  observed  by  Cimbala. 

Another  difference  between  the  near  and  far  fields  of  a  free  shear  layer  is  that  the  layer 
emerges  into  the  far  field,  after  the  mixing  transition,  with  three  dimensional  structure.  In 
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particular,  a  secondary,  spanwise  instability  leads  to  the  development  of  streatnwise  vorticity 
which  organizes  itself  into  a  lacework  of  “ribs”  between  the  primary  rollers  (Bernal,  1981; 
Hussain,  1984;  Rogers  and  Moser,  1992).Three  dimensional  rib  structure  has  also  been  observed 
in  wakes  in  laboratory  experiments  and  numerical  simulations  (Meiburg  and  Lasheras,  1990). 

It  may  seem  surprising  that,  in  the  presence  of  three  dimensionality,  the  agreement  with 
theoretical  instability  calculations  for  two  dimensional  layers  and  wakes  could  be  as  good 
as  it  is  in  the  comparisons  described  above  but,  as  noted,  the  agreement  is  best  in  the  early, 
low-amplitude  stages  of  the  instability,  where  linear  theory  still  applies.  There  the  spanwise 
instability  is  still  uncoupled  from  the  primary  instability.  In  the  mixing  layer,  the  spanwise 
instability  is  slower  than  the  primary,  thus  the  latter  dominates  and  results  in  quasi  two- 
dimensional  rollers.  In  the  wake,  the  spanwise  instability  may  develop  as  fast  as  the  primary 
(Robinson  and  Saffman,  1982),  resulting  in  structures  that  are  fully  three  dimensional  (wavy 
or  loopy);  but  in  edge  views  the  wake  still  looks  like  a  “Karman-street”  (Breidenthal,  1980; 
Cimbala,  1984). 

With  increasing  distance  downstream,  the  primary  instability  and  the  secondary  instability 
both  rescale  themselves  as  may  be  seen  in  the  edge  and  plan  views  in  Fig.  3.  In  the  mixing 
layer,  rescaling  of  the  secondary  instability  lags  the  primary;  in  the  wake  they  appear  to 
proceed  together.  This  continuously  rescaled  large  structure  dominates  the  smaller  structures 
which  evolve  from  convectively  developing  chaos  and  from  higher-order  instabilities. 


4.  IMPLICATIONS  AND  APPLICATIONS  OF  COHERENT  STRUCTURE 

The  developing  knowledge  of  coherent  structure  has  been  useful  in  various  aspects  of 
turbulent  shear  flow,  in  ways  that  were  usually  not  accessible  within  the  context  of  traditional 
modelling.  As  indicated  above,  they  have  given  qualitative  and  quantitative  insights  into 
traditional,  measured  correlations  and  spectra.  Other  phenomena,  such  as  intermittency  at  the 
“turbulent  interface”  and  the  mechanics  of  entrainment  have  also  been  illuminated.  In  the 
following  sections,  we  briefly  discuss  other  developments,  in  particular  the  appearance  of  new 
directions  in  modelling. 

4.1  Parametric  Effects  on  Growth  Rates. 

The  interpretation  of  coherent  structure  as  a  manifestation  of  the  underlying  primary  insta¬ 
bility  suggested  a  connection  between  growth  rates  <16/ dx  and  instability  amplification  rates 
based  on  the  mean  velocity  profile.  A  model  of  the  direct  connection  took  some  time  to  work 
out  (see  section  4.4)  but  even  earlier  the  effects  of  various  parameters  could  be  described.  It 
was  noted  for  example  that  density  difference  between  the  two  streams  of  a  mixing  layer  has  an 
effect  on  the  growth  rate,  d6/dx,  which  is  qualitatively  the  same  as  the  effect  on  amplification 
rate  in  linear  stability  theory;  that  is,  lower  density  on  the  high  speed  side  increases  amplifi- 
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cation  rate  and  growth  rate,  respectively;  and  vice  verse.  Using  results  from  linear  stability 
theory,  Brown  (1974)  and  Dimotakis  (1986),  obtained  quantitative  formulas  for  the  effect. 

Similarly,  a  connection  with  stability  theory  was  made  for  the  effect  of  compressibility 
on  mixing  layers  between  high  speed  flows  of  gases.  The  strong  effect  of  compressibility, 
to  decrease  growth  rate,  correlates  with  a  corresponding  effect  on  amplification  rate  given  by 
linear  stability  theory  (Papamoschou,  1986).  When  the  Mach  number  is  defined  in  a  frame 
which  is  convecting  with  the  wave  or,  equivalently,  with  the  coherent  structure  (Bogdan off, 
1982),  it  provides  the  proper  reference  for  evaluating  compressibility  effects  .  The  effects  of 
this  “convective  Mach  number”  on  the  growth  rates  of  mixing  layers  for  a  variety  of  parametric 
combinations  of  velocities  ( U2 ,  U\)  and  gas  properties  (p,  p\,  0,2 ,  a\)  is  shown  in  Fig.  6. 
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Fig.  6.  Effect  of  compressibility  on  turbulent  free  shear  layers.  All  data  normalized 
at  Mc  =  0.  Growth  rates  from  experiment.  Amplification  rates  from  linear 
stability  theory.  (Papamoschou,  1986;  Zhuang  et  al,  1990). 


4.2  Turbulence  Control  and  Sensitivity. 

The  existence  of  coherent  structures  suggested  that  they  could  be  manipulated  and  the 
mean  flow  development  thus  modified,  i.e.  “controlled”  in  the  current  idiom.  An  example  is 
the  substantial  reduction  of  the  drag  of  a  bluff  cylinder  that  can  be  achieved  by  simply  placing 
a  splitter  plate  in  its  wake  to  interfere  with  the  formation  and  shedding  of  vortices  (Roshko, 
1955).  But  the  possibility  for  accomplishing  such  control  in  the  far-field  of  a  turbulent  shear 
layer  is  a  little  more  subtle  because  there  is  no  fixed  scale,  as  discussed  earlier,  and  because 
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of  possible  difference  in  the  nature  of  the  underlying  instability,  “absolute”  cf.  “convective” 
(Huerre  and  Monkewitz,  1985). 

Actually,  control  can  be  exercised  in  several  ways.  One  is  to  change  the  stability  scenario, 
as  with  the  splitter  plate  in  the  near  wake.  Another  is  to  change  the  external  environment,  i.e.  to 
“change”  the  excitation  or  “forcing”  to  which  the  flow  may  be  receptive.  Such  “active  control” 
of  a  turbulent  jet  was  demonstrated  in  the  experiment  by  Crow  and  Champagne  ( 1 97 1 )  in  which 
acoustic  excitation  ai.  the  jet  origin  enhanced  growth  rates  when  suitable  frequencies,  related 
to  the  “jet-mode”  instability  frequency,  were  used.  The  latter  is  analogous  to  the  "Strouhal” 
frequency  of  vortex  shedding  from  a  bluff  body. 

For  the  far  field  of  the  mixing  layer,  where  there  is  no  fixed  scale,  Oster  and  Wygnanski 
(1982)  showed  that,  by  introducing  a  small,  periodic  oscillation  at  the  trailing  edge  of  the  splitter 
plate,  they  could  produce  a  strong  effect  on  the  growth  rate,  dd/dx.  In  a  later  experiment, 
Roberts  (1985)  produced  similar  effects  by  adding  to  the  velocity  of  one  stream  a  periodic 
variation  of  about  1%.  Roughly,  the  effect  is  to  increase  the  growth  rate  in  the  early  part 
of  the  shear  layer  and  then  to  stop  it  further  downstream,  at  a  value  of  x  which  increases 
with  decreasing  forcing  frequency.  After  an  interval  of  inhibited  growth,  the  layer  resumes 
its  far-field  growth  rate.  These  effects  and  their  relation  to  the  shear  layer  parameters  can 
be  understood  as  the  instability  response  of  the  developing  shear  layer  to  periodic  excitation 
(Browand  and  Ho,  1983;  Ho  and  Huerre,  1984)  at  an  amplitude  which,  though  small,  takes 
control  away  from  the  broad  band  of  other,  available  disturbances.  The  periodic  disturbance 
forces  the  appropriate  wave  to  emerge  and  grow  until  d(x)  reaches  a  cut-off  value,  after  which 
the  forcing  tends  to  lock  in  the  wave.  Corresponding  effects  on  Reynolds  stress  and  mixing 
rate  are  quite  spectacular. 

This  sensitivity  to  relatively  small,  imposed  disturbances  raises  fundamental  questions  about 
how  the  turbulent  mixing  layer  sustains  itself,  whether  by  self  excitation  or  by  external  forcing 
from  its  environment.  The  ordinarily  smooth,  linear  mean  growth  rate  indicates  that  the  shear 
layer  is  responding  to  a  broad  band  of  excitation.  That  excitation  could  come  from  disturbances 
in  the  free  stream  (which  ordinarily  have  broad  spectrum)  or  by  Biot-Savart  feedback  to  the 
origin  from  the  developing  turbulent  layer  itself.  It  is  at  the  splitter-plate  trailing  edge,  the  origin 
of  the  shear  layer,  that  the  shear  layer  instability  is  most  receptive  to  excitation.  The  presence 
of  such  feedback  was  noted  by  Dimotakis  and  Brown  (1976);  its  possible  role  in  establishing 
self  excitation  was  studied  by  Kaul  (3988).  It  has  also  been  noted  in  connection  with  effects  of 
“outflow  boundary  conditions”  in  numerical  simulations.  Also  relevant  to  the  question  of  how 
the  turbulence  sustains  itself  is  the  nature  of  the  underlying  instability,  whether  “convective” 
or  "absolute”. 

4.3  Necessity  for  Lagrangian  Viewpoints. 

An  important  property  of  coherent  structures,  implicit  in  the  name,  is  their  coherence  over 
significant  lifetimes.  This  naturally  makes  for  a  Lagrangian  view  of  them.  Such  a  view 
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is  indispensable  in  situations  where  unsteady  data  and  their  correlations  are  needed,  as  in 
meteorology.  But  even  for  problems  in  which  only  the  mean  fluxes  of  a  station^  flow  are 
required,  as  for  the  classical  shear  flows  discussed  above,  it  is  sometimes  advantageous  or 
even  crucial  to  first  describe  structures  in  a  Lagrangian  frame  in  order  to  correctly  describe  the 
processes  leading  to  the  final,  mean  values. 

Such  a  case  is  the  problem  of  determining  the  rate  of  production  of  chemical  reaction 
products  in  turbulent  mixing  of  reactant  fluids,  in  mixing  layers  and  jets.  To  account  for 
some  puzzling  effects  in  such  flows,  and  cognizant  of  the  observed  large,  vortical  entraining 
motions  associated  with  the  coherent  structures,  Broad  well  and  Breidenthal  (1982)  proposed  a 
Lagrangian  model  of  turbulent  mixing  in  which  the  coherent  structures  are  viewed  as  “reactors” 
into  which  the  two  fluids  are  entrained  by  the  large  scale  motions,  then  stirred  down  to  smaller 
scales  and,  finally,  molecularly  and  chemically  mixed.  This  approach  makes  it  possible  to 
rationally  incorporate  aspects  of  chemistry  that  are  sensitive  to  how  the  reactants  are  brought 
together.  In  the  final  result,  mean  values  of  product  fluxes  are  obtained,  but  in  the  Lagrangian 
approach  some  of  the  modelling  is  done  before  averaging.  A  review  of  the  method  and  of 
some  of  its  developments  (Kerstein,  1992)  and  applications  may  be  found  in  a  recent  paper  by 
Broadwell  and  Mungai  (1991). 

Even  in  the  basic  problem  of  modelling  the  Reynolds  stresses  and  the  development  of  the 
mean  flow,  the  limitations  of  the  Eulerian  reference  frame  are  hidden  when  the  shear  flow  is 
an  equilibrium  flow.  In  non  equilibrium  flows,  e.g.  relaxation  from  one  equilibrium  to  another 
(Narasimha  and  Prabhu,  1972),  the  “memory”  implicit  in  the  lifetime  of  coherent  structure  may 
be  important  in  the  mechanics  and  for  modelling. 

4.4  New  Directions  in  Modelling. 

As  stressed  in  the  Introduction,  a  principal  objective  for  a  theory  of  turbulence,  certainly  for 
applied  mechanics,  are  models  which  predict  the  turbulent  fluxes  in  shear  flows,  in  particular 
the  turbulent  stresses.  At  one  extreme  of  possible  approaches  are  the  Re  models,  which  are 
designed  for  broad  applicability  but  contain  empirical  closure  constants.  At  the  other  extreme 
is  direct  numerical  simulation  (DNS),  i.e.  the  solution  by  numerical  computation  of  the  exact, 
unsteady  Navier-Stokes  equations.  Such  realization  of  turbulent  flow  in  time-evolving  detail 
in  a  computer  is  not  really  a  “model”  but  rather  an  alternative  to  experimental  realization  in  the 
laboratory,  an  alternative  which  has  many  attractive  and  useful  features.  It  is  limited  at  present 
in  the  values  of  Reynolds  number  (~  103)  for  which  accurate  simulation  can  be  achieved.  But 
even  when  and  if  it  becomes  possible  to  numerically  simulate  turbulent  flows  at  will,  there 
will  still  be  a  practical  need  and  an  intellectual  incentive  to  seek  the  simplification  and  insight 
provided  by  models. 

Between  the  extremes  of  Re  modelling  and  DNS  is  the  possibility  for  models  which  in 
some  way  incorporate  the  underlying,  driving  instability.  Indeed,  efforts  to  base  turbulence 
models  on  hydrodynamic  instability  existed  even  before  impetus  was  given  to  such  efforts  by 
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the  observations  of  coherent  structure.  Examples  include  the  marginal  instability  theories  of 
Malkus  (1956)  for  turbulent  boundary  layers  and  Lessen  (1974,  1976)  for  free  shear  flows; 
and  the  wave-guide  theory  of  Landhal  ( 1 967)  for  turbulent  boundary  layers.  Theories  which 
more  explicitly  sought  to  derive  the  production  of  Reynolds  stress  from  the  instability  include 
those  of  Tam  and  Chen  (1979);  Zhang  and  Lilly  (1981);  and  Liu  (1988).  Insights  have  also 
been  obtained  from  related  efforts  to  model  acoustic  noise  radiation  from  turbulent  jets,  by 
associating  the  source  of  the  radiation  with  the  instability  waves  in  the  jet  free  shear  layers 
(Mollo-Christensen,  1967;  Liu,  1974;  Ffowcs  Williams,  and  Kempton,  1978;  Tam  and  Morris, 
1 980;  Tam  and  Burton,  1984). 

The  present  status  of  these  modelling  approaches  is  well  represented  in  the  recent  papers 
of  Morris,  Giridhavan  and  Lilley  (1990)  and  Liou  and  Morris  (1992),  who  have  constructed 
models,  without  empirical  constants,  for  the  turbulent  development  of  free  shear  layers.  Explicit 
use  is  made  of  the  Rayleigh  stability  equations,  which  are  for  inviscid  flow. 
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for  the  eigenfunction  </>  of  velocity  fluctuations.  The  distribution  and  evolution  of  u',  v'  and 
thus  of  the  Reynolds  stress  are  computed,  in  coupled  interaction  with  the  developing  mean  flow 
equations  (1M3),  described  earlier.  In  Eq.  (5),  a  and  ft  are  streamwise  and  spanwise  complex 
wave  numbers,  respectively,  u;  is  the  complex  frequency  and  u(y)  is  the  mean  velocity  profile. 
(Basically,  the  distribution  of  the  mean  vorticity  dtt/dy  determines  the  instability.)  Morris  et 
al.  explore  several  methods  for  coupling  the  equations.  The  simplest  of  these  is  an  integral 
method,  in  which  a  shape  function  is  assumed  for  u(y).  A  set  of  coupled  equations  is  obtained 
for  the  shear  layer  thickness  6(x)  and  the  fluctuation  amplitude  A{x).  For  each  wave  mode 
A(u>,  P),  the  equations  have  the  form 
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where  (~a,),  the  imaginary  part  of  a,  is  the  amplification  factor  and  C  is  a  constant  that 
depends,  for  each  shear  layer,  on  the  velocity  ratio  U2/U1  and  on  the  shape  function  assumed 
for  u(y).  If  the  shear  layer  contains  more  than  one  wave  mode  the  right  hand  side  of  Eq.  (7) 
becomes  a  summation,  or  integration,  over  the  spectrum  of  wave  numbers.  The  computation 
of  the  flow  development  is  initiated  at  x  =  0  (the  “trailing  edge”)  with  an  initial  spectrum 
>1(0;  u>,  (3)  of  small  amplitude,  0.01.  Results  for  broad  band  as  well  as  discrete  spectra  are 
obtained  by  Morris  et  al. 


Equation  (6)  is  derived  from  the  energy  equation  (3)  for  the  turbulent  energy  production,  that 
is  the  wave  energy  production.  At  this  point,  modelling  assumptions  are  introduced.  The  direct 
viscous  dissipation  and  the  energy  transfer  terms,  except  p'u',  are  omitted.  The  justifications, 
in  particular  for  the  treatment  of  the  energy  transfer  from  large  to  small  scales,  are  based  on 
insights  from  experiment  (e.g.  Ho  and  Huang,  1982)  and  on  the  earlier  models.  The  result  is  a 
model  with  no  empirical  constants. 

The  values  for  growth  rate  dfi^/dx  computed  from  the  model  agree  well  with  experimental 
data  (Fig.  7).  Results  from  a  more  genera!  vision  of  the  model  than  that  outlined  above,  which 
allows  for  effects  of  density  ratio  or  of  compressibility,  are  also  obtained.  Comparison  with 
available  experimental  data  for  the  effect  of  compressibility  is  also  good.  For  example,  it  is 
found  that  spanwise  instability,  represented  by  wave  numbers  (3,  begins  to  play  a  significant 
role  only  at  higher  values  of  convective  Mach  number  (~  0.5),  as  had  been  found  by  Sandham 
and  Reynolds  (1990)  in  direct  numerical  simulations. 
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Fig.  7.  Growth  rate  of  free  shear  layer;  dependence  on  velocity  difference. 

O  model  of  Morris  et  al  (1990);  O  A  V  x  experimental  points  com¬ 
piled  by  Brown  and  Roshko  (1972). 

5.  SUMMARY  AND  CONCLUDING  REMARKS 

The  evidence  that  turbulent  shear  flows  contain  organized  structure,  and  the  idea  that  they 
are  related  to  hydrodynamic  instability  in  the  mean  turbulent  flow,  have  been  around  for  a 
long  time,  cf.  the  reviews  by  Liepmann  (1952,  1962).  Only  recently  have  they  fully  come 
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together  in  the  view  that  the  primary  instability  due  to  the  mean  global  vorticity  distribution  is 
the  driving  mechanism  for  the  developing  turbulent  flow.  The  genesis  of  the  turbulence  is  in 
that  instability.  Insightful  theories  should  include  that  fundamental  mechanism  as  the  one  from 
which  other  aspects  of  the  turbulence  derive  and  to  which  they  relate:  the  indeterminancy  and 
statistical  nature  of  the  coherent  structures;  the  development  of  smaller  and  larger  structure; 
and  so  on.  Inputs  from  the  theories  of  dynamical  systems,  chaos  and  fractals  should  be  useful 
in  illuminating  various  aspects  of  such  theories  for  turbulent  shear  flow. 

The  evidence  fcr  the  views  presented  is  most  obvious  in  the  free  shear  layer,  where  the 
primary  instability  creates,  basically,  a  row  of  vortex  structures  all  of  one  sign,  and  it  is  not 
surprising  that  a  successful  model  has  been  worked  out  first  for  that  case.  In  plane  wakes  the 
basic  configuration  is  two  rows  of  structures  of  opposite  sign,  and  the  same  is  true  for  plane  jets, 
which  have  not  been  explored  as  extensively.  For  axisymmetric  wakes  and  jets,  the  coherent 
structure  is  more  difficult  to  discern  in  the  far  field,  because  probablv  the  primary  instabilities 
are  helical.  The  existence  of  large  structure  is  not  apparent  in  the  far-field  picture  in  Fig.  1  but 
it  has  been  educed  for  similar  large  jets  from  computer  enhanced  video  movies  (Mungal  et  al, 
1989, 1992).  Morris  has  applied  his  model  to  the  near  field  of  round  jets,  i.e.  to  the  annular  free 
shear  layer  (Viswanathan  and  Morris,  1992).  The  far  field  is  still  to  be  addressed  but,  in  time, 
the  joining  of  instability  theory  with  the  mean  flow  equations  will  no  doubt  be  extended  to  this 
and  other,  canonical  shear  flows.  All  these  developments  are  still  in  the  exploratory  stage. 
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Abstract 

The  characteristic  property  of  the  phenomena  studied  by 
micromechanics  is  the  governing  influence  of  the  variations  of  the 
material  microstructure  on  the  macroscopic  behavior  of  bodies.  In  the 
mathematical  models  of  such  phenomena,  the  macroscopic  equations  of 
mechanics  and  the  kinetic  eqrations  of  the  microstructural 
transformations  form  a  unified  set  that  should  be  solved  simultaneously. 

Fracture  occurs  in  the  zones  of  high  stress  concentration  where  the 
phenomenon  is  often  complicated  by  phase  transformations,  chemical 
transformations,  and  heat  generation.  This  makes  the  application  of  the 
micromechanical  approach  to  fracture  phenomenon  very  natural. 

At  the  beginning  of  the  present  lecture,  a  concise  general  outline  of 
micromechanics  is  given.  Applications  of  scaling,  intermediate 
asymptotics,  and  renormalization  groups  to  the  micromechanics  of 
fracture  are  discussed.  Subsequently,  an  example  of  the  application  of 
the  micromechanical  approach  to  the  study  of  damage  accumulation  is 
considered. 


1.  INTRODUCTION 

Monsieur  le  President,  Ladies  and  Gentlemen:  First  of  all  I  want  to 
thank  the  Organizers  of  the  Congress  collectively  and  individually  for  the 
honour  rendered  to  me  by  the  Invitation  to  present  this  lecture. 

The  construction  of  continuum  mechanics,  *  the  general  mathematical 
model  of  motions  and/or  equilibrium  of  real  deformable  bodies  usually 
proceeds  in  the  following  way.  After  proper  definition  of  the  basic 
concepts  such  as  the  observer,  continuous  medium,  etc.,  the  general 
covariance  principle  and  conservation  laws  are  introduced.  According  to 
the  covariance  principle,  the  laws  of  motion  of  the  continuous  media 
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should  be  expressed  via  the  equations  equally  valid  for  all  observers.  This 
principle  of  equality  of  all  observers  claims  the  invariant  nature  of  the 
medium  and  its  motions  and  therefore  greatly  simplifies  the  analysis. 
Conservation  laws  express  the  general  fact  that  matter,  momentum, 
angular  momentum,  energy,  etc.,  do  not  arise  spontaneously  and  do  not 
disappear  by  themselves.  In  combination  with  the  covariance  principle, 
the  conservation  laws  lead  to  such  concepts  as  flux  vector,  stress  tensor, 
energy  flux,  couple  stresses,  etc. 

Conservation  laws  and  the  covariance  principle  taken  alone  are 
insufficient  for  designing  the  mathematical  models  of  the  motions  of  real 
bodies.  To  achieve  this  goal,  it  is  necessary  to  provide  the  continuum  with 
physical  properties.  This  means  that  a  certain  apriori  model  of  the 
continuum  is  proposed  which  is  adequate  for  the  class  of  motions  and 
bodies  under  consideration,  but  is  generally  valid  only  for  restricted 
classes  of  the  motions  of  real  bodies.  Such  models,  the  so-called 
constitutive  equations,  are  represented  by  certain  relations  between  the 
properties  of  the  motion  (or  the  state)  of  the  continuous  medium  and  the 
active  internal  forces. 

Simplest  examples  are  the  classical  models  of  continuous  media  such 
as  Hooke's  elastic  solid.  This  model  for  an  ideally  linear  isotropic  elastic 
body  is  characterized  by  three  constants:  density  p,  and  two  elastic 
constants,  e.g.  Young  modulus  E  and  Poisson’s  ratio  v.  For  obtaining  the 
mathematical  model  for  the  brittle  fracture  phenomenon,  Hooke’s  model 
should  be  extended  since  fracture  phenomena  do  not  exist  in  the 
classical  linear  elasticity  problem  statement.  Extension  of  the  model  of  an 
ideal  elastic  solid  for  bodies  with  cracks  assumes  that  the  body  is 
perfectly  linearly  elastic  outside  infinitely  thin  cracks.  However,  cohesive 
forces  are  acting  on  the  crack  surfaces  near  the  edges  of  the  cracks.  The 
distribution  of  the  cohesive  forces  over  the  crack  surface  in  the  mobile 
equilibrium  state  is  autonomous,  i.e.  independent  of  the  form  and 
position  of  the  outside  edge  regions  of  the  cracks.  In  this  extended 
model,  an  additional  constant  appears:  fracture  toughness  or  cohesive 
modulus  Kq,  which  is  an  integral  characteristic  of  the  cohesive  forces. 

It  as  assumed  in  using  classical  models  that  the  constants  entering  the 
constitutive  equations  are  universal  ones.  This  means  that  these 
constants,  being  determined  from  some  special  experiments,  preserve 
their  values  for  arbitrary  motions  described  by  the  considered  model. 

An  approach  similar  to  the  classical  encounters  essential  difficulties 
when  applied  to  many  new  materials  entering  modern  research  and  high 
technology,  or  even  traditional  structural  materials  under  extreme 
conditions  characteristic  of  fracture  regions.  Researchers  attempted  at 
first  to  use  modified  classical  models,  especially  for  new  materials.  So, 
three,  four,  and  even  eight  model  constants  appeared.  This  procedure, 
however,  is  unreliable.  It  is  not  only  the  growing  complexity  of  the 
determination  of  new  constants  that  arises,  nor  even  the  loss  of  a  clear 
physical  sense  of  these  constants.  Much  worse  is  that  these  constants,  so- 
to-say,  "cease  to  be  constants",  i.e.  the  range  of  universality  where  the 
constants  of  the  model  can  be  considered  as  genuinely  universal  ones 
becomes  so  narrow  that  the  models  lose  their  predictive  capability. 

The  reason  for  that  is  well  known.  Every  material  has  its  own 
relaxation  time  x,  i.e.  a  characteristic  time  when  the  shear  stresses  are 


preserved  under  imposed  fixed  shear  strain.  In  turn,  every  process 
introduces  its  own  characteristic  time  t.  Therefore,  for  each  process  in 
each  material  a  governing  dimensionless  parameter  appears,  the  so- 
called  Deborah  number, 

De=x/T  (1.1) 

For  "fluid-like"  behavior  of  a  material  in  a  process,  De  should  be  De«l, 
while  for  "solid-like"  behavior,  De»l. 

We  now  note  that  the  normal  duration  of  a  human  experiment 
encompasses  about  18  decimal  orders  of  magnitude:  from  Ins  =  10'9s,  to 
109s  ~  30  years,  so  that  10‘9s<T<109s  is  the  range  of  the  usual  human 
experiments.  For  water  at  normal  conditions,  x~1012s,  while  for  steel  at 
normal  temperatures  and  not  too  large  stresses,  x~1012s.  Thus  for  the  full 
range  of  human  experiments  at  normal  conditions,  De«l  for  water,  and 
De»l  for  steel.  But  that  does  not  apply  to  steel  near  the  edge  of  an 
extending  crack  where,  due  to  the  stress  concentration  and/or  heat 
generation,  x  is  strongly  reduced.  It  also  does  not  apply  for  well  known 
synthetic  materials  such  as  silly  putty,  for  which  x~ls  under  normal 
conditions.  That  material  demonstrates  all  types  of  behavior  ranging  from 
perfectly  elastic  and  brittle  when  it  is  struck  or  torn  quickly,  to  perfectly 
viscous  when  it  takes  the  form  of  a  cup  where  it  is  placed  for  a  period  of 
less  than  an  hour. 

For  describing  such  mixed  behavior,  an  approach  was  advanced  which 
seemed  at  first  glance  attractive:  spatial  and  temporal  non-locality  was 
introduced  to  the  models.  It  meant  practically  that  functionals  were 
introduced  into  the  constitutive  equations.  For  instance,  for  the  so-called 
"simple  body", 

a  =  u(F(0))  (1.2) 

(Here  o  is  the  stress  in  a  given  particle,  F(0)  the  strain  gradient,  u  the 

constitutive  functional,  and  0  is  the  time  over  the  whole  range  from  the 
beginning  of  deformation  to  the  actual  current  time  t:  -°°<0<t). 

Without  going  into  details,  it  is  noted  that  the  appearance  of 
functionals  in  constitutive  equations  always  means  that  the  approach  is 
insufficient.  To  demonstrate,  let  us  consider  the  simplest  example:  gas 
flow  in  a  tube  with  heat  conducting  walls.  Imagine  an  obviously 
nonsensical  case:  we  do  not  wish  to  introduce  temperature  as  a 
consideration.  Evidently  the  density  of  a  certain  particle  will  become  a 
functional  of  pressure.  It  is  not  necessary  to  explain  that  in  gas  dynamics 
since  another  way  is  used  (although  not  in  the  mechanics  of  polymers). 
The  temperature  is  introduced  into  consideration  and  an  energy  equation 
is  added  to  the  mathematical  model.  The  temperature,  however,  is 
nothing  but  the  mean  energy  of  microscopic  molecular  motion,  and  the 
energy  equation  can  be  interpreted  as  a  kinetic  equation  for  this  directly 
observable  characteristic  of  the  microstructure.  Introduction  of  the 
temperature  simplifies  everything  since  at  least  the  functionals  disappear 
from  consideration. 
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2.  MICROMECHANICS 

In  the  last  decades,  and  most  intensively  in  recent  years,  an 
alternative  approach  has  appeared  more  and  more  frequently  in  research 
practice.  According  to  this  approach,  the  properties  of  the  material 
microstructure,  directly  or  indirectly  observable,  are  explicitly 
introduced  into  consideration.  The  equations  of  macroscopic  motions  and 
those  of  the  kinetics  of  microstructural  transformation  are  considered 
simultaneously. 

In  our  opinion  this  is  the  very  subject  of  micromechanics.  Thus,  in  our 
understanding,  micromechanics  is  the  branch  of  mechanics  studying  the 
phenomena  for  which  the  variations  of  microstructure  are  of  governing 
influence  for  the  macroscopic  behavior  of  bodies. 

If  the  time  scales  of  the  processes  under  consideration  are  such  that 
the  variations  of  the  microstructure  can  be  considered  either  as 
instantaneous  or  as  negligible,  a  classical  approach  neglecting 
microstructural  variations  and  using  finite  constitutive  equations  - 
integrals  of  the  kinetic  equations  -  can  be  applied. 

Concerning  the  very  term  "micromechanics”,  often,  and  in  particular 
in  the  works  of  Professor  J.  Achenbach,  Professor  Z.  Hashin,  Professor  V. 
Entov  and  other  scientists,  this  term  is  used  in  a  more  narrow  sense 
denoting  the  technique  for  determining  macroscopic  properties  of 
bodies  on  the  basis  of  certain  models  of  their  microstructure.  From  our 
viewpoint,  this  is  only  a  certain  part  of  the  subject  of  micromechanics 
and  we  prefer  the  wider  definition  given  above. 

We  do  not  have  time  and  place  here  for  including  a  comprehensive 
historical  analysis  of  the  origin  and  formation  of  micromechanics.  It 
seems  necessary,  however,  to  mention  several  milestone  works  which  are 
doubtless  related  to  micromechanics  in  the  sense  just  mentioned.  At  the 
same  time,  they  belong  to  researchers  whose  reputation  in  mechanics  is 
undisputable  -  they  are  generally  accepted  as  outstanding  representatives 
of  the  mechanical  community.  The  last  point  is  specially  important:  the 
mechanical  community  is  rather  conservative  -  in  the  good  sense  of  the 
word  -  in  matters  that  concern  the  subject  of  mechanical  research. 
Therefore,  the  example  offered  by  outstanding  people  within  the 
mechanical  community  is  important:  to  be  modem  in  mechanics,  the 
researchers  should  know  micromechanics.  The  first  to  be  mentioned  is 
Th.  von  Karman,  one  of  the  great  founders  of  IUTAM  and  ICTAM's.  In  a 
series  of  papers,  summarized  to  some  extent  in  his  Maryland  lecture 
entitled  "Aerothermochemistry"  [1),  Th.  von  Karman  formulated  the 
general  problem  of  aerothermochemistry  as  the  problem  of  fluid 
dynamics  with  chemical  transformations  accompanied  by  heat  generation. 
For  the  transformation  rate  W  of  an  active  component  of  the  mixture,  he 
used  semi-empirical  formulae  of  the  Arrhenius  type: 

W~(  1  -a)pexp(-  U/kT)  (2.1) 

Here  a  is  the  mass  concentration  of  an  active  component  under 
consideration,  U  -  a  constant  called  the  activation  energy,  k  -  the 
Boltzmann  constant,  p  -  "reaction  order",  -  another  constant.  The  heat 
generation  rate  is  assumed  to  be  proportional  to  the  reaction  rate. 


It  is  instructive  that  Th.  von  Karman  did  not  intend  to  invent  a  new 
branch  of  science  motivated  only  by  the  internal  needs  of  science.  Just 
the  opposite:  he  was  one  of  the  first  who  understood,  for  example,  that 
the  problem  of  space  vehicle  reentry  to  the  Earth's  atmosphere  cannot  be 
solved  without  proper  consideration  of  the  chemical  transformation  and 
dissociation  of  air.  Even  clearer  for  him  was  the  necessity  to  create  this 
new  branch  of  mechanical  science  in  his  attempts  to  design  a  rational 
theory  of  liquid  fuel  combustion  in  jet  engines. 

In  the  general  lecture  on  the  XIV  ICTAM  (2)  devoted  to 
"Micro hydrodynamics",  G.K.  Batchelor  summarized  a  series  of  works 
concerning  micromotions  of  small  bodies  in  fluids.  Subsequent  papers  of 
G.K.  Batchelor  and  his  students,  especially  E.J.  Hinch,  applied  the  results 
and  approaches  of  microhydrodynamical  studies  to  create  the  modern 
hydrodynamics  of  suspensions. 

In  his  papers  [3,4]  entitled  "Micromechanics",  B.  Budiansky 
emphasized  the  growing  attention  of  the  mechanical  community  to  "the 
mechanics  of  very  small  things".  He  considered  several  problems, 
seemingly  special  but  in  fact  very  important  from  the  general  viewpoint  of 
the  micromechanics  of  solids,  such  as  transformation  -  toughening  due  to 
phase  changes  and  particulate  toughening. 

It  is  instructive  to  look  at  the  photographs  of  microstructural  pictures 
from  the  mentioned  papers  by  G.K.  Batchelor  [2]  and  by  B.  Budiansky  [3] 
to  better  represent  the  objects  which  have  recently  entered  the  fields  of 
interest  of  the  mechanical  community. 

B.  Budiansky  writes  [4]:  "Micromechanics  is  the  currently  fashionable 
designation  of  what  is  really  an  old  subject,  but  one  that  is  receiving 
increasing  attention  from  theoreticians  in  applied  mechanics.  Armed 
with  their  repertoire  of  analytical  tools,  they  try  to  relate  the  overall 
deformation  and  strength  properties  of  the  materials  to  the  behaviors  and 
interactions  of  their  microscopic  constituents".  Try  to  relate  does  not 
always  mean  the  use  of  finite  equations:  they  can  be  related  by  differential 
equations,  and  that  is  exactly  the  definition  used  here.  A  natural  question 
arises:  whether  the  coining  of  a  new  term  is  important  and  justified, 
thereby  legitimizing  a  new  branch  of  continuum  mechanics  and  giving  it  a 
status  of  certain  independence? 

I  suppose,  yet,  it  is.  The  examples  of  the  theory  of  oscillations  which 
considers  oscillations  in  a  way  irrelevant  to  their  physical  nature, 
functional  analysis,  cybernetics,  and  to  a  lesser  extent  cynergetics, 
confirm  it.  A  general  approach  as  well  as  a  unified  style  of  the  analysis  of 
new  phenomena  appears,  which,  to  a  certain  extent,  leads  to  a  new 
general  ideology.  Seemingly  uncoordinated  results  appear  in  unified  form. 
The  transfer  of  results  from  one  subject  to  another  becomes  possible  as 
well  as  the  prediction  of  results  based  on  the  previous  experience. 
Therefore  the  legitimization  of  micromechanics  as  a  unified  new  branch 
of  continuum  mechanics  seems  to  be  well  deserved  and  expedient. 

For  a  more  or  less  detailed  further  presentation,  1  had  to  make  a 
choice.  So,  I  selected  the  application  of  similarity,  more  exactly  scaling 
(renormalization  group)  methods,  in  the  micromechanics  of  fracture  and 
the  micromechanical  analysis  of  damage  accumulation. 


3.  APPLICATION  OP  SIMILARITY  METHODS  IN  MICROMECHANICS 
OF  FRACTURE 

A.  The  systems  of  equations  of  micromechanical  models  are  usually 
complicated,  and  analytical  solutions  of  such  sets  may  be  obtained  only  as 
rare  exceptions.  Even  obtaining  numerical  solutions  involves  serious 
problems.  Therefore,  the  asymptotic  methods  and,  in  particular,  simi¬ 
larity  methods  play  an  important  role  in  the  micromechanics  of  fracture. 

The  application  of  similarity  methods  in  fracture  mechanics  has  a 
long-time  established  tradition.  Here  we  will  speak  about  scaling  (or 
similarity  laws  of  the  second  type)  which  are  closely  connected  with  the 
renormalization  group:  a  concept,  very  fashionable  now  in  theoretical 
physics.  We  illustrate  it  firstly  on  a  simple  geometrical  example.  Consider 
two  continuous  curves  of  diameter  D.  One  of  them  is  a  normal  circle.  We 
inscribe  a  regular  n-gon  with  side  length  r\  in  it.  For  the  length  of  the 
perimeter  of  the  polygon,  L^,  dimensional  analysis  gives 

Lp  =  D<J>(q/D)  (3.1) 

For  sufficiently  small  q/D,  the  function  <I>  is  arbitrarily  close  to  its  limit  n, 
so  asymptotically  Lp=jrD.  It  is  trivial  that  this  asymptotic  is  invariant  with 
respect  to  a  transformation  group 

Lp  =  Lyp  D'=  D,  q’  =  Xq  (3.2) 

(X<lis  a  group  parameter).  We  refer  to  this  case  as  the  similarity  of  the 
first  type,  or  complete  similarity. 

The  second  curve  is  a  Mandelbrot  fractal,  for  instance  von  Koch  curve 
(5,6].  (It  was  B.  Mandelbrot  who  coined  the  very  term  "fractal"  and  was 
able  in  his  excellent  monographs-essays  (5,6)  to  show  systematically  the 
importance  of  such  objects  which  entered  mathematics  and  physics  early 
in  this  century  for  modern  natural  philosophy  in  the  whole.)  Then,  the 

asymptotic  of  the  function  O  in  (3.1)  at  small  q/D  is  d>~C(q/D)'a,  where 
a=Const,  0<a<l,  so 

Lp=  Const(D1+a/qa)  (3.3) 

It  is  clear,  that  at  q/D->0  there  is  no  finite  limit,  Lp— ><».  Meanwhile 

Lpqa  tends  to  a  finite  limit,  and  the  asymptotic  (3.3)  is  invariant  with 
respect  to  the  transformation  group 

t 

Lp=  X-aLp,  D’  =  D,  q'  =  Xq  (3.4) 

-  the  simplest  example  of  the  renormalization  group  mentioned  above. 
The  number  m=l+a  is  called  the  fractal  dimension  of  the  curve  under 
consideration.  We  refer  to  such  a  case  as  scaling,  similarity  of  the  second 
type,  or  incomplete  similarity. 
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The  natural  step  in  micromechanics  of  fracture  proposed  by  B. 
Mandelbrot  and  his  colleagues  [7]  was  to  investigate  the  fracture  surface 
of  metals  -  whether  the  fracture  surface  is  fractal  and,  if  so,  what 
information  can  be  extracted  from  its  fractal  dimensions? 

An  excellent  result  of  the  perfect  fractality  of  fracture  surfaces  was 
obtained  by  V.K.  Horvath  and  H.J.  Herrmann  (8)  for  stress-corrosion 
cracks  (Fig.  1).  More  typical  and  natural  are  the  results  demonstrated  by 
R.H.  Danskardt,  F.  Haubensack  and  R.O.  Ritchie  (9J:  there  exist  several 
intervals  of  fractality  between  the  natural  material  length  scales.  The  non¬ 
trivial  question  is  how  the  fractal  dimension  of  the  fracture  surface  is 
correlated  to  fracture  toughness  of  the  material  or.  in  other  words,  how 
to  use  these  results  for  quantitative  strength  calculations?  Although  this 
question  has  no  definite  answer  yet,  it  is  a  good  problem. 

The  application  of  the  similarity  approach  combined  with 
micromechanical  analysis  to  fatigue  crack  studies  is  instructive.  The 
classical  result  in  fatigue  fracture  is  the  scaling  law  by  Paris  and  Erdogan 
[10)  for  crack  propagation  under  multi-cycle  fatigue: 

df/dn  =  A(AK)m  (3.5) 

Here  df/dn  is  the  fatigue  crack  velocity  per  cycle  averaged  over  the 
cycle  and  AK  is  the  stress- intensity  factor  amplitude.  An  example  of  such 
a  "kinetic  diagram"  is  presented  in  Fig.  2. 

Let  us  consider  [1 1-131  the  kinetic  diagram  (3.5)  from  the  viewpoint 
of  the  similarity  approach.  The  average  velocity  df/dn  can  depend  on  the 
following  quantities:  AK,  R=Kmln/Kmax  -  the  asymmetry  of  loading,  h  -  the 
characteristic  specimen  size,  f  -  frequency,  KIC  -  standard  fracture 
toughness,  g  -  yield  stress,  t  -  time.  Dimensional  analysis  gives 

d*/dn  =(AK/oy)2<D(AK/KIC,R,z,ft) 
where 

z  =  oy  Vh/KIC 

is  the  basic  similarity  parameter.  An  asymptotic  stage  is  considered 
where  the  influence  of  the  argument  ft  disappears.  The  Intermediate- 
asymptotic  character  of  fatigue  crack  propagation  is  clearly  confirmed  by 
analysis  of  the  fracture  surface:  a  regular  system  of  striations  appears  at 
this  stage.  Moreover,  the  parameter  AK/KIC  is  small.  Then,  two 
possibilities  appear:  complete  similarity  when  the  limit  of  O  at  AK/KIC— >0 
is  finite  or  not.  If  it  is  finite,  we  would  obtain  the  power  law  (3.5)  with 
m=2  which  is  practically  never  the  case.  Let  us  assume,  however,  that  the 
incomplete  similarity  (similarity  of  the  second  kind  [14.15))  takes  place 
at  the  intermediate  asymptotic  stage  of  fatigue  crack  extension,  when 

<D  =  (AK/KIC)a<t>1(R,z)  (3.8) 

and  a,  in  principle,  should  also  depend  on  the  similarity  parameter  z. 


(3.6) 

(3.7) 
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Figure  1.  The  straight  line  on  the  log-log  plot  of  the  number  of  "filled" 
boxes  of  the  net;  covering  the  section  of  the  cracked  body  against  the  box 
size  demonstrates  the  fractal  character  of  the  stress  corrosion  cracks  in 
Inconel  600  in  high  temperature  water  (after  [8)).  (The  circles  and 
crosses  denote  various  methods  of  data  processing). 


Substituting  (3.8)  to  (3.6)  we  obtain  exactly  the  scaling  law  (3.5)  with  the 
result  that  m  depends  on  z,  i.e.  the  specimen  size!  Experiments  (see 
Fig.  3)  have  shown  that  this  dependence  is  often  very  strong,  so  the 
designers  should  be  careful  in  using  the  results  of  the  experiments  with 
small  specimens  in  predicting  the  life-time  of  large  structures. 

Two  important  notes  should  be  added  here.  In  fact,  in  the  middle  of 
the  kinetic  diagram  the  mechanism  of  fracture  starts  to  change;  traces  of 
static  fracture  modes  such  as  dimples  appear  on  the  fracture  surface.  It  is 
connected  quantitatively  with  reaching  some  critical  stress  intensity 

factor  Kqy  equal  in  order  of  magnitude  to  oyV  d,  where  d  is  the 

microstructural  length-size,  e.g.  the  grain-size.  Therefore,  the  Paris- 
Erdogan  kinetic  curve  apparently  consists  of  two  sections  corresponding 
to  AK»KGy  and  AK<<KGY.This  was  not  properly  identified  apparently 
because  the  values  of  m  corresponding  to  the  two  branches  of  the 
diagram  were  sufficiently  close. 

The  second  note  is  that  sometimes  (it  was  m  ticed  with  the  aluminium 
alloys  and  required  careful  investigation  with  good  statistics)  the  kinetic 
diagrams  are  not  single-valued  curves  (Fig.  4),  and  there  exists  a  rather 
large  overlapping  range  (see  (131).  Fracture  surface  analysis  shows  that 
the  upper  and  lower  branches  of  this  non-single-valued  kinetic  curve 
correspond  to  different  fracture  mechanisms.  Apparently  these  curves 
explain  the  remarkable  effect  noticed  by  Forsyth  (161  -  the  alternating  of 
smooth  and  rough  regions  at  the  surface  of  fatigue  cracks. 

B.  The  modern  tendency  of  using  high  strength  materials  has 
advanced  the  problems  of  multiple  fracture.  The  statistical  approach 
cannot  be  avoided  here,  although  it  is  not  phenomenological  statistics  in 
the  Weibull  spirit  of  the  late  thirties.  A  combination  of  the  statistical 
approach  with  modem  fracture  mechanics  is  needed  here. 

In  seme  conceptual  aspect,  an  analogy  with  developed  turbulence  was 
found  to  be  fruitful.  In  the  turbulence  phenomenon  we  have  the  fluid 
instead  of  the  deformable  solid,  the  vortices  instead  of  cracks  and 
defects.  The  turbulent  flow  contains  a  cascade  of  interacting  vortices  of 
various  length  scales  The  motion  of  a  certain  small  number  of  vortices  in 
the  fluid  can  be  studied  more  or  less  effectively  as  well  as  the  evolution  of 
a  small  number  of  cracks  or  defects  in  a  solid  under  varying  load  or 
strain.  Nothing  can  be  performed  effectively  if  there  are  multiple  vortices 
in  a  fluid  flow  or  multiple  defects  or  cracks  in  a  deformable  solid.  So, 
everything  really  achieved  in  turbulence,  and  the  famous  Kolmogorov- 
Obukhov  theory  first  and  foremost,  is  based  on  the  statistical  approach 
and  similarity  principles  for  the  vortex  cascade.  In  the  studies  of  multiple 
fracture  this  approach  is  also  followed. 

We  assume,  therefore,  the  physical  scheme  of  multiple  fracture  in  the 
following  way.  There  exists  in  a  deformable  solid  a  cascade  of  crack-like 
defects,  pores,  vacancies,  dislocations,  etc.  which  are  interacting.  The 
interaction  of  the  defects  could  be,  in  principle,  of  two  kinds:  defects  can 
either  strengthen  of  weaken  each  other.  Furthermore,  the  stress 
concentration  near  the  tips  of  large  defects  stimulates  the  generation  of 
new  small  defects.  On  the  other  side,  small  defects  stimulate  the 
extension  and  coalescence  of  large  ones. 


Figure  3.  The  dependence  of  the  exponent  in  the  Paris-Erdogan  law  (3.5) 
on  the  similarity  parameter  z  for  the  steel  4340  for  specimens  of  various 
orientation  (A,L,T)  with  respect  to  the  rolling  direction:  A(O),  L(#).  T(x) 
([111  on  the  basis  of  data  [151). 
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Figure  4.  Kinetic  diagram  of  fatigue  fracture  of  Aluminium  alloy  B95  T1 
for  notched  specimens.  Two  branches  of  the  diagram  with  a  large 
overlapping  region  are  clearly  seen  ([13]). 


The  basic  hypothesis  of  our  approach,  the  similarity  principle  |18j  is 
that  in  multiple  fracture  the  process  of  the  development  of  the  cascade  of 
defects  is  statistically  self-similar.  This  principle  means  that  in  the 
development  of  the  multiple  fracture  process,  only  the  dimensional 
parameters  vary  with  a  statistically  averaged  defect  size  lsc  and  mean 
distance  between  the  defects  Lsc.  The  dimensionless  statistical 
properties  of  the  cascade  remain  invariable. 

In  particular,  the  distribution  curve  of  the  defects  size  exists  for  a 
cascade.  Self- similarity  means  that  when  damage  accumulation  is 
proceeding,  the  form  of  this  curve  remains  fixed  and  only  the  maximum 
frequency  Nsc  and  the  corresponding  mean  defect  size  isc  are  varying. 

Therefore  the  size  distribution  can  be  represented  in  the  following 
universal  form: 

N/Nsc  =  <W<//sc>  (3.9) 

Three  basic  questions  appear  in  connection  with  this  similarity 
principle: 

I.  Does  self- similarity  of  the  defects  cascade  evolution  exist  for  the 
whole  damage  accumulation  process  or  at  least  for  a  certain  part  of  it? 

II.  If  self-similarity  is  only  observed  on  certain  stages  of  the  multiple 
fracture  process,  what  is  the  reason  of  its  loss  and  of  the  transition  from 
one  self- similar  stage  to  the  next  one? 

III.  In  general,  does  self-similarity  of  the  cascade  evolution  have  some 
practical  and/or  theoretical  interest? 

Let  us  answer  these  questions  successively. 

I.  The  data  processing  of  the  defects  cascade  evolution  in  damage 
accumulation  has  shown  that  the  similarity  principle  does  not  contradict 
at  least  some  parts  of  the  fracture  process  in  creep,  fatigue,  and  static 
tension  for  materials  of  various  classes.  So,  the  pore  size  distribution  for 
iron  at  various  strain  levels  coincides  upon  being  plotted  in  universal 
coordinates  (Fig.  5). 

Thereby,  the  pore  size  distributions  at  various  stages  of  the  damage 
accumulation  process  can  be  obtained,  one  from  another,  by  a  similarity 
transformation.  Only  the  parameters  Nsc,  tsc  are  found  to  be  time- 

dependent.  Moreover,  the  universal  size  distribution  curves  of  the  defects 
obtained  in  the  same  way  for  different  materials  tested  in  creep,  fatigue, 
or  static  tension  turn  out  to  be  rather  close  in  spite  of  the  differences  in 
material  properties,  as  well  as  in  the  loading  conditions,  and  especially  in 
the  length  scales  -  from  the  atomic  level  to  blocks  of  rocks  the  size  of 
tens  of  miles  (Fig.  6).  This  apparently  means  that  there  exists  a  certain 
universality  in  the  damage  accumulation  process,  at  least  for  a  sufficiently 
wide  class  of  materials. 

II.  The  multiple  fracture  process  is  multistage  and  multiscale. 
Apparently,  transitions  occur  from  one  self-similar  stage  to  the  next  one. 
The  loss  of  self-similarity  seems  to  be  related  to  the  loss  of  stability  of 
cascade  evolution  at  a  certain  critical  value  of  the  length  size  /sc.  On  the 

background  of  smaller  defects,  a  new  cascade  appears  with  its  own 
evolution  laws  and  new  self-similar  stage. 
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Figure  5.  Pore  size  distributions  in  natural  (a)  [19],  and  universal  (b) 
coordinates  for  crystalline  iron  specimens  (0.006%C)  tested  in  creep 
under  9.3  MPa  tensile  stress  and  700°C  temperature.  1:  €=2.1%,  t=23 
hours;  2:  e=6.2%.  t=142  hours,  3;  e=9.3%,  t=262  hours.  Ng^:  maximum 

frequency,  ^g^:  corresponding  pore  size  {after  (13)). 


Figure  6:  Defect  size  distribution  for  various  materials  under  various  types 
of  loading.  1  -  polycrystailine  copper,  fatigue;  2  -  brass,  static  tension; 
3  -  iron,  creep;  4  -  steel  347,  creep;  5  -  steel  304,  creep;  6  -  rocks  (after 
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III.  For  a  symmetric  isolated  crack  of  the  length  l,  the  condition  for 
the  critical  value  of  the  stress  intensity  factor  K  at  the  tip  of  the  crack 

K=K0/'Jn/2  (3.10) 

gives,  as  is  well  known,  a  finite  relation  between  the  crack  length  and 
applied  load.  If  there  are  multiple  cracks  in  a  body,  then  a  set  of  2n  finite 
equations  (n  -the  number  of  cracks)  can  be  obtained  by  application  of 
condition  (3.10)  at  each  crack  tip.  This  set,  however,  becomes  non¬ 
constructive  with  growing  n.  while  for  a  rather  small  number  like 
n=  10-20  obtaining  the  solution  becomes  impossible  even  using  modem 
computers.  The  statistical  self- similarity  of  the  cascade  of  defects  returns 
us  to  the  simplest  case  analogous  to  that  of  an  isolated  symmetric  crack. 
Indeed,  the  finite  relation  exists  between  the  average  crack  size  in  the 
cascade  *sc  and  the  applied  load.  Moreover,  the  principal  possibility 
appears  to  be  to  estimate  the  loading  capacity  as  well  as  the  life-time  of  a 
structure  using  the  parameter  *sc  and  material  properties. 

The  Paris-Erdogan  scaling  law  (3.5),  considered  above  in  some  detail, 
also  follows  ultimately  from  the  self- similarity  of  the  cascade  of  defects  in 
the  vicinity  of  the  crack  head.  Moreover,  from  the  self-similarity 
hypothesis  of  the  cascade  of  defects,  the  Coffin-Manson  equation  for  low- 
cycle  fatigue,  which  is  well  known  to  experimentalists, 

nj{Ae](Ae)a  =  Const  (3.11) 

can  be  obtained  by  a  simple  analysis  (12).  Here  njfAel  is  the  number  of 
cycles  to  failure  at  a  given  strain  amplitude  Ae,  a  -  a  material  constant.lt  is 
interesting  that  the  well-known  Palmer-Minor  damage  accumulation  rule 
at  variable  strain  amplitudes: 

n 

J  dn/nj{Ae(n)]=l  (3.12) 

o 

also  follows  rigorously  from  the  self-similarity  hypothesis  formulated 
above. 

It  is  also  noted  that  the  principle  of  statistical  self-similarity  of  the 
cascade  of  defects  could  be  a  distinctive  bridge  between 
phenomenological  strength  studies  and  the  microscopic  studies  of 
dislocations,  vacancies,  etc.  which  as  yet  have  no  constructive  outcome  to 
practical  strength  analysis.  Remember  again  the  analogy  with  turbulence. 
In  the  theory  of  turbulence,  the  methods  of  practical  computation  of 
turbulent  flows  are  always  based  on  the  idea  of  the  vortex  cascade  self¬ 
similarity  (including  various  subgrid  simulations  in  numerical  models) 
which  allows  one  to  reduce  the  calculations  to  solving  a  rather  reasonable 
system  of  the  equations  for  turbulent  energy  density,  average  vortex  size 
(or  some  related  quantity),  etc,  A  similar  approach  seems  to  be  possible, 
in  principle,  in  multiple  fracture  studies. 
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4.  DAMAGE  ACCUMULATION 

The  study  of  damage  accumulation  in  a  cylindrical  specimen  under 
tension,  influenced  by  microstructural  effects,  is  apparently  one  of  the 
characteristic  examples  of  the  micromechanical  approach  to  fracture 
studies.  Here  we  will  discuss  a  tentative  mathematical  model  of  this 
phenomenon. 

The  concept  of  damage  was  introduced  to  fracture  mechanics  by  L.M. 
Kachanov  (19,201,  F.K.G.  Odqvist  and  J.  Hult  [21,221.  According  to  this 
approach  the  damage  factor  w  is  introduced,  or,  equivalently,  the 
continuity  factor  \|/=l-o).  In  the  virgin  material  co=0,  vy=l  and  fracture 
corresponds  to  co=l,  y=0. 

A  physical  interpretation  of  damage  at  a  point  is  the  properly  averaged 
relative  amount  of  broken  bonds  between  microstructural  elements  of  the 
body.  Under  the  natural  assumption  that  every  broken  bond  excludes  a 
fixed  volume  of  the  material  from  carrying  capacity,  the  simplest 
interpretation  of  damage  is  obtained  as: 


o)  =  (S-Sr)/S  (4.1) 

where  S  is  the  specimen  cross-section  area  and  Sr  is  the  area  of  the  part 
of  the  cross-section  where  the  elements  are  able  to  carry  the  tensile  load. 
A  kinetic  law  of  damage  accumulation  is  assumed  according  to  [19-221 
which  can  be  represented  in  the  form 

dtco  =  (1/t)  q(<o,o,T)  (4.2) 

Here  t  is  the  time,  x  -  a  certain  constant  characteristic  time  of  the 
damage  accumulation  process,  o  -  the  actual  stress  in  the  load-carrying 
part  of  the  specimen  cross-section  which  is  related  to  the  bulk  tensile 
stress  o0  and  damage  w  for  the  case  of  the  simplest  damage 

interpretation  (4.1)  by  an  equation, 

o  =  o0/(l-co),  (4.3) 

and  T  is  the  temperature.  The  dimensionless  function  q  is  assumed  to  be 
specified  for  the  given  material.  We  emphasize  that  stress  and 
temperature  dependence  of  damage  accumulation  rate  is  much  stronger 
than  the  direct  dependence  on  the  current  damage.  As  an  example,  the 
often  used  Arrhenius-type  kinetic  law  with  stress-dependent  activation 
energy  can  be  considered, 

q=(l-<o)pexp[-U-ya)/kTJ  (4.4) 

Here  k  is  the  Boltzmann  constant,  U  -  the  zero-stress  activation  energy,  p 
-  a  constant,  "reaction  order",  y  -  the  other  kinetic  constant.  The 
quantities  U/kT,  yc0/kT  are  large,  of  the  order  of  ten  or  even  more,  so 

the  stress  and  temperature  dependence  of  the  damage  accumulation  rate 
is  strong.  Power-type  kinetic  laws  are  also  in  use. 
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q=(  1  -£o)P(cr/cr0)n  (4.5) 

where  p,  n  are  dimensionless  constants  and  n  is  assumed  to  be  large. 

For  fixed  constant  temperature,  the  relation  (4.2)  can  be  considered 
as  an  ordinary  differential  equation  for  the  damage  to  as  a  function  of  time. 
An  important  point  is  that  its  solution  exists  only  for  a  finite  time.  When 
the  value  <o  becomes  equal  to  one,  the  solution  of  the  ordinary  equation 
(4.2)  ceases  to  exist  which  defines  the  specimen  life-time,  tQ.  It  is  noted 
that  due  to  initial  damage  inhomogeneity,  the  current  damage 
distribution  can  be  nonuniform  over  the  specimen  length.  According  to 
the  approach  formulated  above,  damage  accumulation  would  proceed 
independently  in  each  cross-section  over  the  specimen  length.  Thus 
rupture  and  life-time  of  a  specimen  will  be  determined  by  the  ultimate 
damage  value  o>=l  being  reached  in  the  initially  most  damaged  cross- 
section  without  any  influence  from  the  neighbouring  cross-sections. 

We  will  see  that  the  influence  of  the  material  microstructure  can 
drastically  change  the  statement  of  this  problem.  Namely,  a  specific  non¬ 
linear  diffusion-type  mechanism  for  damage  transfer  appears.  Therefore, 
the  ordinary  differential  equation  (4.2)  is  replaced  by  a  non-linear 
parabolic  partial  differential  equation  with  a  diffusion  coefficient  that  is 
strongly  stress  dependent. 

Indeed,  real  materials:  metallic  alloys,  ceramics,  composites,  rocks, 
etc.  always  possess  a  particular  microstructure.  In  polycrystalline  metals, 
the  microstructure  is  formed  by  the  grains  and  intercrystalline  matter, 
while  a  supramolecular  structure  of  various  orders  exists  in  polymers,  etc. 
This  microstructure  is  microinhomogeneous;  we  have,  so-to-say, 
"microstiffeners"  and  "microweakeners"  distributed  over  the  material. 
Damage  accumulation  consists  in  breaking  bonds  between  the  elements 
of  microstructure  and/or  inside  these  elements  and  subsequent 
formation  and  coalescence  of  defects.  An  instructive  example  is  the 
cavitation  and  the  formation  of  micro-cracks  at  the  grain  boundary  facets 
studied  by  V.  Tvergaard  (23,24).  Due  to  microinhomogeneity  of  the 
microstructure  (different  grain  sizes,  and  grain  properties),  this  damage 
accumulation  process  becomes  microinhomogeneous.  Moreover,  the 
breakthrough  of  cracks  at  the  grain  boundary  facets  increases  the  local 
stresses  arising  under  the  action  of  the  bulk  tensile  stress.  At  the  same 
time  there  is  a  reduction  of  the  local  damage  accumulation  rate  in 
stronger  places  (a  compressive  stress  due  to  local  "microstiffening"). 
Thus,  in  addition  to  the  acting  bulk  tensile  stress,  a  system  of  self- 
balanced  microstresses  appears. Therefore  the  stresses  in  an  idealized 
microscopically  homogeneous  material  differ  from  those  really 
influencing  the  local  damage  accumulation  rate. 

Thus,  consider  a  prismatic  bar  under  static  tension  o0  and  isothermal 
conditions.  Microscopic  damage  distribution  and  microstress  fields  are 
stochastic.  Both  fields  are  considered  as  statistically  homogeneous  over 
the  cross  sections,  although  in  general,  non-homogeneous  over  the  bar 
length  and  time -varying. 

By  averaging  relation  (4.2)  which  is  valid  for  ideally  homogeneous 
conditions  over  the  cross-section,  we  obtain 
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at<G»=(l/-c)((l/S)  J  q((o,o,T)dS)l  ,  (4.6) 

S 

where  the  brackets  <>,  which  we  will  further  omit,  denote  averaging  of 
the  ensemble.  (Due  to  the  ergodicity  hypothesis  which  is  natural  here,  we 
can  replace  averaging  over  the  cross-section  by  ensemble  averaging). 
However,  the  right-hand  side  cannot  be  replaced  by  the  damage 
accumulation  rate  corresponding  to  the  average  stress  a0/(l-co)  as  it 

should  be  in  the  microhomogeneous  case;  there  exists  a  certain 
additional  microstress  field  and  the  stress  dependence  of  the  damage 
accumulation  rate  is  strong. 

To  calculate  the  right-hand  side  of  (4.6),  we  will  use  the  approach 
often  used  after  Khinchin  and  Kolmogorov  in  the  theory  of  stochastic 
processes.  Indeed,  the  right-hand  side  of  (4.6)  will  be  replaced  by  an 
integral  over  the  bar  length  by  introducing  a  certain  weight  function 
p(x-f): 


(1/S)  j  q(©.o,T)dS=  j  q(co,o£),T)p(x-£)d4  (4.7) 


Here  the  weight  function  p(x-£)  determines  the  relative  amount  of 
elements  of  the  cross-section  area  where  the  damage  accumulation  rate 
corresponds  to  the  actual  average  stress  in  a  neighbouring  cross-section 
having  a  certain  displaced  coordinate  this  always  can  be  done.  We  used 
here  essentially  the  fact  that  the  average  damage  co  and  consequently  the 
average  actual  stress  are  the  fields  continuously  varying  over  the  bar 
length  so  that  the  actual  stress  at  a  certain  point  of  the  cross-section, 
increased  or  reduced  by  micro-stresses  or  microinhomogeneity, 
corresponds  to  the  average  stress  in  a  certain  neighbouring  cross-section. 
Furthermore,  under  the  assumption  that  the  damage  accumulation  rate  is 
strongly  stress  dependent,  the  weight  function  p(x-£)  is  rapidly 
decreasing  with  growing  modulus  of  its  argument.  Therefore,  the  limits 
in  the  right-hand  side  integral  of  (4.7)  can  be  taken  to  be  infinite. 

Obviously,  the  weight  function  p(x-^)  satisfies  the  relation 


J  p(x-^)d^=l 


(4.8) 


by  consideration  of  an  ideally  microhomogeneous  case.  For  the  sake  of 
some  simplification,  we  also  assume  that  the  weight  function  is 
symmetric,  so  that 


OO 


j  (x-5)  p(x-4)d^=0  . 


(4.9) 
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Furthermore,  we  introduce  the  microstructural  length-scale  X  by  the 
relation 


X2=(l/2)  |  (x-^)2p(x-^)dx  (4.10) 

~oo 

It  is  noted  that  we  neglected  the  dependence  of  the  weight  function  on 
the  current  damage  for  simplicity  sake  since  otherwise  X  would  be  a 
function  of  the  current  damage. 

Now  we  expand  the  function  q(a),o(^),T)  under  the  integral  sign  in  the 
right-hand  side  of  (4.7)  into  the  Taylor  series  near  the  point  £=x: 


q((o,a,(£),T)  =  q(o),o,(x,t).T)  +  (^-x)0^q)^_x  + 


(x^ 

+  2 


@^q)$=> 


(4.11) 


Neglecting  the  remaining  terms  which  can  be  properly  justified  and 
substituting  the  expansion  (4.11)  to  (4.7),  using  the  relations  (4.8M4.10), 
and  substituting  the  result  in  (4.6).  we  come  to  the  partial  differential 
equation. 


X2 

atco  =  (l/t)q(o>,o(x,t),T)  +  ax{(Y  ^o^xSJ 


(4.12) 


Obviously  damage  under  ordinary  conditions  cannot  heal,  so  that  equation 
(4.12)  is  valid  only  if  its  right-hand  side  is  not  negative;  otherwise  it  is 
replaced  by 

3tco  =  0  (4.13) 


Using  the  relation  (4.3)  between  the  average  actual  stress  and  the 
average  damage,  we  obtain  from  (4.12)  an  evolution  equation  for  the 
damage  or. 


8to>  =(l/x)q(co,a0/(l-co),T)+dx[{^  ooacq)  axw]  (4.14) 

Thus,  instead  of  the  ordinary  differential  equation  (4.2),  a  non-linear 
diffusion-type  equation  is  obtained  for  the  damage  co,  where  the  diffusion 
coefficient  is  strongly  stress  dependent.  The  resultant  mathematical 
problem  is  currently  fashionable  among  the  constellation  of 
mathematicians  concerned  with  blow-up  and  quenching  problems  of  non¬ 
linear  partial  differential  equations  (see  the  recent  publications  (25-28)). 
(Blow-up  and  quenching  correspond  to  fracture;  in  fracture  problems, 
however,  an  additional  non-linearity  related  to  condition  (4.13)  is  added.) 


In  the  simplest  case  of  the  Arrhenius -type  damage  kinetics  of  zero 
order,  equation  (4.14)  assumes  the  dimensionless  form 

2 

8q\) f  =  \y2eV  +  ky2  9^eV  =  I(\y)  (4. 15) 

if  I>0,  and  =0  if  I<0.  Here 
0=(t/t)exp{-U/kT),  y=p/(l-co),  p=ya0/kT 

C=x/L,  k=X2/L2  (4.16) 

where  L  is  the  characteristic  length-size  of  the  initial  damage  distribution 
over  the  bar  length. 

The  initial  condition  takes  the  form 

V(C.O)  =  p/(l-o>0(0)  (4.17) 

where  to0(Q  is  the  initial  damage  distribution  over  the  specimen  length. 
The  boundary  conditions  are  taken  in  the  form  of  vanishing  damage  flux 
at  the  boundaries: 

3^(0,  0)=O,  3^(1.0)=O  (4.18) 

which  corresponds  to  a  periodic  damage  distribution  of  wavelength  L  or 
strongly  reinforced  specimen  grips.  The  solution  should  be  obtained  in 
the  region 

0<C<1,  O<0<0o  (4.19) 

Here  0O  corresponds  to  the  blow-up  (fracture)  time  where  the  solution  y 
at  a  point  becomes  infinite.  The  specimen  life-time  is  determined  by  the 
equation 

t0=eoxeU/kT  (4.20) 

It  is  interesting  to  consider  for  demonstration  purposes  the  results  of 
the  numerical  integration  of  the  problem  (4.15).  (4.17)  (4.18)*, 

presented  in  Figs.  7,8. 

In  Fig.  7a,  the  process  of  damage  accumulation  for  extremely  low 
k=10'5  is  presented.  As  we  see,  no  damage  diffusion- extension  of  the 
damaged  region  is  observed.  In  Fig. 7b  (k=10  3),  the  extension  of  the 
damage  region  is  clearly  observed,  and  it  is  seen  that  damage 

accumulation  in  the  initially  most  damaged  cross-section  commences 
only  after  noticeable  extension  of  the  damaged  region  which  occurs  at  a 
time  which  is  one  order  of  magnitude  larger  than  the  life-time  without 


•Numerical  calculations  were  performed  by  Dr.  V.M.  Prostokishin. 


Figure  7a.  Damage  evolution  at  various  values  of  dimensionless  diffusion 
parameter  k.  |p=yao/kT=10,9^=4.7-10'2  -  dimensionless  life-time  at 

k=O.J;  k=10'5,  (0) -0=0;  (l)-0/0*  =0.55;  (2J-0/0*  =1.05;  (3J-0/0*  =1.29; 


(4)-0/0*  =  1.31. 
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Figure  7b.  Damage  evolution  at  various  values  of  dimensionless  diffusion 
parameter  k.  {p=7ao/kT=lO.0Q=4.7- 10'2  -  dimensionless  life-time  at 

k=0.1;  k=10'3,  (0)-9=0;  ( 1  )-0 / 0^=4.O:  (2)  0/0^=  16.0;  (3)-0/eJ=24.32; 

(4)  -8/0*  =24.33;  (5)-0/0*  =24.34. 
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Figure  7c.  Damage  evolution  at  various  values  of  dimensionless  diffusion 
parameter  k.  (|i=yoo/kT=10,9*  =4.7- 10'2  -  dimensionless  life-time  at 

k=0.j;  k=10'1.  (O)-0=O;  (D-0/0^1.0;  (2)-0/0*  =6.0;  (31-0/8*  =1 1.0; 

(4)-e/9*  =  16.0;  (5)-0/e‘=21.O;  (61-0/0^=28.0; 

(71-6/0^=29.25;  (81-0/0^=29.26;  0/0^=29.27 


Figure  8.  Graph  of  the  function  0^/0  =F(k).  dimensionless  life-time  ratio 

for  positive  k,  and  k=0.  Existence  of  a  critical  value  of  k  separating  two 
regimes  is  seen.  Curve  { l )  corresponds  to  p=l,  |i=yoQ/kT=10;  curve  (2) 
corresponds  to  p=0,  p=lO;  curve  (3}  corresponds  to  g=20.  p=0  and  p=l 
(the  curves  for  both  values  practically  coincide). 


damage  diffusion.  Figure  7c  (k=  1  O' 1 )  is  instructive:  initially,  damage 
uniformly  covers  the  specimen,  and  then  damage  accumulation  proceeds 
uniformly  over  the  specimen.  Note  the  damage  wave  propagation  (curves 
2-4).  Figure  8  demonstrates  the  increase  of  the  specimen  life-time  with 
increasing  dimensionless  diffusion  factor  k. 

It  is  plausible  that  for  large  stress  dependence  of  the  accumulation 
rate  there  exists  a  critical  value  of  the  diffusion  parameter  kcr.  For  k<kcr, 
damage  accumulation  proceeds  non-uniformly  and  rupture  occurs  in  the 
cross-section  that  has  the  largest  initial  damage.  For  k>kcr  damage 
accumulation  proceeds  in  the  following  way:  initially  damage  becomes 
uniformly  distributed  over  the  specimen  and  this  process  takes  the  major 
part  of  the  specimen  life-time.  Upon  reaching  uniform  damage 
distribution,  damage  accumulation  continues  uniformly  over  the 
specimen  length  subsequently  leading  to  fracture. 

The  tentative  model  of  damage  accumulation  was  considered  here 
mainly  to  demonstrate  the  approach  and  to  discuss  an  apparently  new 
phenomenon  -  stress- influenced  damage  diffusion.  I  would  like  to  repeat 
here  a  sentence  from  a  very  comprehensive  recent  review  by  A. 
Needleman  and  V.  Tvergaard  [29]  devoted  to  plastic  flow  localization  in 
metals:  "Now  everybody  loves  a  localization  problem!"  Localization  of 
plastic  deformation  into  shear  bands  in  ductile  metals  is,  indeed,  a 
fascinating  problem.The  remarkable  plenary  lecture  by  J.  Rice  [30]  at  XIV 
ICTAM,  the  book  of  J.  Hutchinson  {311.  the  papers  of  the  author  of  this 
review  as  well  as  those  of  other  colleagues  referenced  here  form  really 
fascinating  contributions.  The  approach,  however,  is  based  entirely  on  the 
constitutive  equations.  In  principle,  this  problem  is  close  to  the  problem 
of  damage  accumulation,  and  I  sincerely  hope  that  the  love  of  localization 
problems  will  materialize  also  in  considering  them  by  the  approach 
outlined  above. 

Ladies  and  Gentlemen,  I  come  to  the  conclusion.  You  could  have  heard 
in  recent  years  the  opinion  that  fracture  mechanics,  while  being  so 
fashionable  in  former  times,  is  now  exhausted,  that  there  is  a  crisis  in 
fracture  mechanics,  that  it  is  unable  to  answer  important  questions  of 
engineers,  etc. 

I  do  not  think  so.  Like  a  beautiful  lady  in  her  Balzac  age,  fracture 
mechanics  has  her  past  deserving  to  be  remembered  and  her  promising 
future.  She  attracts  both  young  and  mature  people,  and  of  course  has 
problems  of  development.  I  think  that  the  micromechanical  approach 
which  is  now  in  development  by  many  research  groups  throughout  the 
world  will  help  to  answer  the  existing  questions  and  will  propose  new 
fascinating  challenges.  Moreover,  there  is  in  this  field  some  flavor  of  close 
discoveries,  -  discoveries  important  not  only  for  science  and  technology 
but  for  everyday  life  of  everybody. 
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Material  Instabilities  and  phase  transitions  in  thermoelasticity 
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Abstract 

This  paper  reviews  recent  joint  work  with  J.  K.  Knowles  on  the  propagation  of  phase 
boundaries  in  a  two-phase  thermoelastic  solid.  We  address  some  general  thermo- mechanical 
issues,  present  an  explicit  model,  compare  the  predictions  of  the  model  with  experimen¬ 
tal  observations,  consider  the  influence  of  inertia,  and  describe  the  role  of  viscous  and 
strain-gradient  effects.  This  paper  is  based  on  the  work  described  in  [2, 3, 4, 7], 

1.  INTRODUCTION 

Certain  alloys  can  exist  in  more  than  one  solid  phase,  each  phase  being  associated  with 
a  distinct  crystal  structure.  Typically,  one  phase  is  preferred  under  certain  conditions 
of  stress  and  temperature,  while  another  is  favored  under  different  conditions.  As  the 
stress  or  temperature  vary,  the  material  may  therefore  transform  from  one  phase  to 
another,  leading  to  a  discontinuous  change  in  the  properties  of  the  body.  Examples  of 
such  materials  include  the  shape-memory  alloy  NiTi,  the  ferroelectric  alloy  BaTi03, 
the  ferromagnetic  alloy  FeNi  and  the  high-temperature  superconducting  ceramic  alloy 
ErRtuB 4.  In  each  of  these  examples,  the  transition  from  one  phase  to  another  occurs 
without  diffusion  and  one  speaks  of  the  transformation  as  being  displacive. 

Consider  for  instance  the  class  of  Cu-Al-Ni  alloys  described  by  Otsuka  and  Shimizu 
[36].  These  alloys  exist  in  a  cubic  phase  (austenite)  and  an  orthorhombic  phase  (marten¬ 
site).  Under  stress-free  conditions,  the  cubic  phase  is  stable  at  temperatures  above  the 
transformation  temperature  dj,  the  orthorhombic  phase  is  stable  below  it.  When  a 
stress-free  specimen  of  austenite  is  cooled  from  above  the  transformation  temperature, 
the  martensite  phase  is  nucleated  at  the  “martensite  start  temperature”  M,  (<  $t)-  The 
specimen  now  consists  of  a  mixture  of  both  austenite  and  martensite,  and  the  phases  are 
separated  from  each  other  by  one  or  more  interfaces  -  phase  boundaries ;  one  speaks  of  the 
phase  boundaries  as  being  coherent,  implying  that  the  deformation  is  continuous  across 
them  even  though  the  deformation  gradient  is  not.  The  phase  boundaries  that  have  nu¬ 
cleated  now  propagate,  transforming  particles  from  austenite  to  martensite  as  they  pass 
through,  eventually  converting  the  entire  specimen  into  martensite.  If  the  specimen  is 
now  heated,  the  whole  process  is  reversed,  with  the  martensite  to  austenite  transforma¬ 
tion  being  initiated  at  the  “austenite  start  temperature”  A,  (>  0r).  T  his  is  a  reversible 
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or  thermoelastic  phase  transition.  The  values  of  the  transformation  temperature  Or  and 
the  nucleation  temperatures  M„  and  As  depend  critically  on  the  alloy  composition  and 
the  heat  treatment;  they  also  vary  with  the  application  of  stress,  e.g.  see  Figure  10  in 
[36]. 

In  the  case  of  Cu-Al-Ni,  there  are  in  fact  a  number  of  different  types  of  martensite, 
distinguished  from  each  other  by  a  difference  in  stacking  order,  see  Figure  15  in  [36], 
Consider  for  example  the  two  martensites  7J  and  (3[.  The  transformation  from  austenite 
— >  7!  martensite  occurs  under  either  stress-free  conditions  or  at  small  values  of  applied 
stress,  while  the  austenite  —►  0\  martensite  transformation  occurs  only  at  sufficiently 
large  values  of  stress.  More  importantly,  a  phase  boundary  in  the  former  case  typically 
propagates  slowly ,  while  a  phase  boundary  between  austenite  and  /3\  martensite  propa¬ 
gates  rapidly  at  speeds  which  are  of  the  order  of  the  shear  wave  speed.  Thus  both  slow 
and  fast-moving  phase  boundaries  occur,  and  both  are  of  interest;  see  also  Grujicic  et  al. 
[22]  and  Nishiyama  [34]. 

Various  continuum-level  issues  related  to  reversible  phase  transformations  in  crys¬ 
talline  solids  have  been  successfully  studied  using  the  theory  of  finite  thermoelasticity, 
e.g.  Ericksen  [17,18],  James  [25]  and  Pitteri  [39].  For  a  thermoelastic  material,  the 
Helmholtz  free-energy  function  ip  depends  on  the  deformation  gradient  tensor  F  and  the 
temperature  0:  ip  =  i/>(F,0);  since  the  occurrence  of  diffusion  is  not  envisaged  here,  it  is 
not  necessary  to  explicitly  account  for  the  dependency  on  alloy  composition.  If  the  stress- 
free  material  can  exist  in  two  phases,  then  the  energy  function  ip  must  have  two  disjoint 
energy-wells,  each  well  corresponding  to  one  phase.  At  6  —  0?,  the  two  minima  have  the 
same  height;  for  0  >  6j  the  austenite  minimum  is  smaller,  while  for  0  <  0j,  the  marten¬ 
site  minimum  is  smaller.  In  the  presence  of  stress  S,  the  situation  may  be  described  in 
similar  terms  except  that  now  one  must  consider  the  energy-wells  of  the  potential  energy 
function  G(F;S,0)  where  S  denotes  the  first  Piola-Kirchhoff  stress  tensor. 

Much  recent  activity  in  continuum  mechanical  studies  on  thermoelastic  phase  tran¬ 
sitions  has  been  focussed  on  two  basic  issues:  the  first  concerns  energy  minimizing  de¬ 
formations  corresponding  to  the  stable  configurations  of  a  body;  the  second  is  related 
to  the  non-equilibrium  evolution  of  a  body  towards  such  stable  configurations  through 
intermediate  states  of  metastability. 

Ericksen  [16]  studied  energy  minimizing  deformations  for  a  two-phase  material  within 
the  one-dimensiona!  mechanical  setting  of  a  tensile  bar.  He  showed,  in  particular,  that  for 
certain  values  of  prescribed  displacement,  the  stable  equilibrium  configurations  of  the  bar 
involve  co-existent  phases.  The  analogous  issue  in  higher  dimensions  is  more  complicated: 
typically,  deformation  gradient  tensors  Fa  and  F„,  corresponding  to  the  austenite  and 
martensite  energy  minima  are  not  kinematically  compatible  with  each  other,  i.e.  F„  —  Fm 
is  not  a  rank-one  tensor.  Therefore  an  energy  minimizing  deformation  cannot  correspond 
to  homogeneously  deformed  states  of  austenite  and  martensite  separated  by  a  phase 
boundary.  In  fact,  an  energy  minimizer  usually  does  not  even  exist,  and  one  must 
contend  instead  with  minimizing  sequences.  The  deformation  pattern  associated  with 


such  a  sequence  characterizes  the  underlying  microstructure  of  the  material;  in  Cu-Al-Ni 
for  example,  an  austenite- martensite  phase  boundary  separates  a  homogeneous  state  of 
austenite  from  a  fine  mixture  of  twinned  martensite.  These  ideas  were  put  forward  by 
Khachaturyan  and  Shatalov  [27]  and  Roitburd  [41]  using  a  geometrically  linear  theory, 
and  by  Ball  and  James  [8]  for  the  finite  deformation  theory;  see  Bhattacharya  [10]  for 
a  comparison  of  these  two  theories.  Ball  and  James  [8]  studied  an  austenite/twinned 
martensite  interface  in  detail,  and  showed  that  the  consequences  of  their  theory  are  in 
agreement  with  the  crystallographic  theory  of  martensite  which  is  a  classical  theory  that 
is  not  based  on  the  idea  of  energy  minimization.  Needle-like  microstructures  and  self- 
accommodating  microstructures  have  been  explored  by  Bhattacharya  [9]  using  similar 
ideas.  The  fineness  of  the  microstructure  is  controlled  by  surface  energy,  though  the 
effects  of  surface  energy  may  be  more  subtle  than  this;  e.g.  see  Gurtin  and  Struthers  [21] 
and  Kohn  and  Muller  [29]. 

The  usual  continuum  theory  of  thermoelasticity,though  adequate  for  characterizing 
two-phase  energy  minimizers,  does  not,  by  itself,  characterize  quasi-static  or  dynamic 
processes  of  a  body  involving  transitions  from  one  phase  to  another.  This  is  illustrated  in 
a  striking  manner  by  the  tremendous  lack  of  uniqueness  of  solution  to  particular  initial¬ 
boundary- value  problems  formulated  on  the  basis  of  the  usual  theory;  see  Abeyaratne 
and  Knowles  [1,3]. 

Quasi-static  or  dynamic  processes  generally  involve  states  that  are  merely  metastable 
and  so  fall  under  the  category  of  “non-equilibrium  thermodynamic  processes.”  Consider¬ 
ations  pertaining  to  the  rate  of  entropy  production  during  such  a  process  leads  naturally 
to  the  notion  of  the  driving  force  (or  Eshelby  force)  f  acting  on  a  phase  boundary,  Abe¬ 
yaratne  and  Knowles  [2]  and  Heidug  and  Lehner  [24];  see  also  Eshelby  [19],  Knowles  [28], 
Rice  [40].  The  theory  of  non-equilibrium  processes  can  then  be  used  to  argue  for  the 
need  for  a  constitutive  equation  -  a  kinetic  law  -  relating  the  propagation  speed  Y'n  of 
the  phase  boundary  to  the  driving  force  /  and  temperature  9:  V„  —  V(f,0).  The  kinetic 
law  controls  the  rate  of  progress  of  the  phase  transition,  as  has  long  been  recognized  in 
the  materials  science  literature,  see  e.g.  Christian  [13];  in  fact,  some,  though  not  all, 
micro-mechanical  models  of  kinetics  lead  to  kinetic  laws  of  the  form  Vn  =  V{f,0). 

The  kinetic  relation  controls  the  progress  of  the  phase  transition  once  it  has  com¬ 
menced.  A  separate  nucleation  criterion  is  required  to  signal  the  init  iation  of  a  transition. 
(This  is  analogous  to  the  roles  played  by  a  flow  rule  and  a  yield  criterion  in  continuum 
plasticity  theory.)  Explicit  models  of  nucleation  are  more  complicated  to  establish.  A 
general  discussion  of  nucleation  theory  in  phase  transitions,  from  a  materials  science 
point  of  view,  may  be  found  in  Christian  [13]. 

Thus  a  complete  constitutive  theory  which  is  capable  of  modeling  processes  involving 
thermoelastic  phase  transitions  consists  of  three  ingredients:  a  Helmholtz  free- energy 
function,  a  kinetic  relation,  and  a  nucleation  criterion.  The  particular  initial  boundary- 
value  problems  referred  to  previously,  which  were  ill-posed  in  the  usual  setting,  were 
shown  in  Abeyaratne  and  Knowles  [1,3]  to  be  well  posed  and  to  have  unique  solutions 
within  this  broadened  setting. 
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A  number  of  studies  have  been  concerned  with  developing  explicit  constitutive  mod¬ 
els.  For  example,  Ericksen  [18]  and  Silling  [45]  have  constructed  three-dimensional 
Helmholtz  free-energy  functions  for  modeling  certain  crystals;  Falk  [20]  has  studied  a 
one-dimensional  polynomial  free-energy  function,  see  also  Jiang  [26].  Models  of  kinetic 
relations  have  been  constructed,  for  example,  by  Muller  and  Wilmansky  [32]  using  certain 
statistical  considerations,  and  by  Otsuka  et  a!.,  [35]  by  assuming  phase-boundary  motion 
to  be  similar  to  dislocation  motion. 

In  [7],  we  constructed  a  complete,  explicit  one-dimensional  model  for  describing  ther¬ 
moelastic  phase  transformations.  In  constructing  our  model,  we  were  motivated  by  a 
desire  to  maintain  great  simplicity,  and  to  construct  an  analytically  amenable  model 
that  could  be  used  to  gain  helpful  insights,  especially  for  fast  transformations,  before  un¬ 
dertaking  computations  based  on  more  realistic  assumptions.  Our  Helmholtz  free-energy 
function  has  two  energy-wells,  corresponding  to  an  austenitic  phase  and  a  single  variant 
of  martensite.  Both  phases  are  modeled  as  having  the  same  constant  elastic  moduli, 
coefficients  of  thermal  expansion  and  specific  heats  at  constant  strain;  the  phases  are 
distinguished  from  each  other  by  the  presence  of  a  transformation  strain  and  a  latent 
heat.  We  also  constructed  an  explicit  example  of  a  kinetic  relation  based  on  the  notion 
of  thermal  activation.  Finally,  the  nucleation  criterion  considered  assumes  nucleation 
to  occur  at  critical  values  of  driving  force,  one  value  signalling  the  initiation  of  the 
austenite  —*  martensite  transition,  and  another,  the  reverse  transition;  this  nucleation 
criterion  can  be  stated  equivalently  .in  terms  of  temperature-dependent  stress-levels  (or 
stress-dependent  temperature-levels).  The  quasi-static  response  of  this  model  to  vari¬ 
ous  thermo-mechanical  loadings  was  also  studied  in  [7j  and  the  results  were  compared 
qualitatively  with  some  experimental  observations  as  described  in  Section  4  below. 

There  is  a  considerable  mathematical  literature  concerned  with  the  issue  of  ident  ifying 
meaningful  solutions  to  a  system  of  quasi-linear  partial  differential  equations;  e.g.  see  the 
review  article  by  Dafermos  [15].  In  the  one-dimensional  purely  mechanical  setting  of  a 
tensile  bar,  the  differential  equations  of  interest  stem  from  linear  momentum  balance  and 
kinematic  compatibility.  A  typical  motion  of  the  bar  involves  moving  strain  discontinu¬ 
ities  corresponding  to  shock  waves,  and  possibly  phase  boundaries,  and  each  discontinuity 
is  subject  to  a  pair  of  jump  conditions  arising  from  the  aforementioned  physical  consid¬ 
erations,  as  well  as  to  the  entropy  inequality  fVn  >  0  arising  from  the  second  law  of 
thermodynamics.  If  the  stress-strain  relation  is  rnonotonically  increasing,  and  in  addi¬ 
tion  is  either  strictly  convex  or  strictly  concave,  phase  transformations  cannot  occur,  and 
all  strain  discontinuities  are  shock  waves;  as  in  gas  dynamics,  the  entropy  inequality  then 
serves  to  single  out  unique  solutions  to  the  Cauchy  problem.  The  stress-strain  curve 
associated  with  a  two-well  energy  function  is  not  monotonic  (it  consists  of  two  rising 
branches  that  are  connected  by  an  intermediate  declining  branch),  and  the  entropy  in¬ 
equality  is  no  longer  sufficient  to  deliver  uniqueness.  The  standard  approach  to  resolving 
this  difficulty  has  been  to  replace  the  entropy  inequality  by  a  stronger  “solution-selecting 
mechanism”  or  admissibility  criterion.  Among  these,  the  most  common  in  the  setting  of 
phase  transformations  are  perhaps  the  viscosity- strain  gradient  criterion  (Shearer  [43], 
Slemrod  [46],  Truskinovsky  [49]),  the  entropy-rate  shock  criterion  (Dafermos  [15],  Hat- 
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tori  [23]),  and  the  chord  criterion  (Shearer  [42],  Pego  [37]).  An  alternative  approach  for 
attempting  to  achieving  uniqueness  is  to  supplement  the  entropy  inequality  (which  must 
hold  at  all  discontinuities)  with  a  kinetic  relation  (at  subsonic  phase  boundaries)  and 
a  nucleation  criterion.  We  have  shown  in  [3]  that  within  the  context  of  the  Riemann 
problem  for  a  special  material,  the  lack  of  uniqueness  remaining  after  imposition  of  the 
entropy  inequality  is  precisely  that  needed  to  accommodate  these  supplementary  ingredi¬ 
ents;  see  also  LeFloch  [31].  In  addition,  we  have  also  shown  that  the  three  aforementioned 
admissibility  criteria  correspond  precisely  to  three  particular  kinetic  relations  of  the  form 
Vn  =  V(f),  [4,5,6]. 

The  solution  to  most  multi-dimensional  problems  involving  phase  transitions  must  be 
obtained  numerically.  Collins  and  Luskin  [14]  and  Silling  [44]  have  studied  energy  mini- 
mizers  using,  respectively,  finite  element  and  finite  difference  based  methods.  Molecular 
dynamics  simulations  have  also  been  carried  out,  e.g.  Yu  and  Clapp  [50]. 

The  present  paper  reviews  some  of  the  aforementioned  results  concerning  phase 
boundary  propagation.  In  Section  2  we  describe  a  general  three-dimensional  thermo¬ 
mechanical  framework  which  includes  the  effects  of  inertia;  this  leads  naturally  to  the 
thermodynamic  notions  of  driving  force  acting  on  a  phase  boundary  and  a  kinetic  law- 
controlling  its  propagation.  We  also  describe  some  characteristics  of  the  potential  energy 
function  associated  with  a  two-phase  material.  From  there  on  we  restrict  attention  to  a 
one-dimensional  setting:  In  Section  3  we  describe  an  explicit  constitutive  model.  Then 
in  Section  4  we  describe  the  quasi-static  response  of  this  model  in  a  number  of  thermo¬ 
mechanical  loading  programs,  including  one  that  displays  the  shape-memory  effect.  The 
predictions  of  the  model  are  compared  qualitatively  with  experimental  observations.  In 
Section  5  we  consider  the  effects  of  inertia  in  the  absence  of  thermal  influences,  and  elu¬ 
cidate  the  precise  roles  of  the  entropy  inequality,  the  kinetic  relation,  and  the  nucleation 
criterion  in  the  context  of  a  Riemann  problem.  Finally,  in  Section  6  we  consider  the 
effects  of  viscosity  and  strain-gradients;  by  considering  a  phase  boundary  to  be  a  suitable 
limit  of  a  travelling  wave,  we  show  that  the  presence  of  these  effects  leads,  in  a  natural 
way,  to  a  specific  kinetic  relation.  A  more  detailed  discussion  of  the  material  in  the  first 
part  of  Section  2,  and  all  of  Sections  3  and  4,  may  be  found  in  [2,7];  other  than  for  a 
slight  difference  in  the  underlying  constitutive  model,  Sections  5  and  6  are  based  on  [3,4]. 

2.  SOME  GENERAL  CONSIDERATIONS 

Consider  a  body  which  occupies  a  region  71  in  a  reference  configuration  and  consider  a 
motion  of  this  body  on  some  time  interval.  Let  St  be  a  propagating  surface  in  71  across 
which  the  deformation  is  continuous  but  the  deformation  gradient  tensor  F  suffers  a 
jump  discontinuity;  suppose  that  F  varies  smoothly  on  either  side  of  St ■  In  the  present 
setting,  such  a  surface  may  represent  either  a  shock  wave  or  an  interface  that  separates 
two  distinct  phases  of  the  material.  Note  that  St  is  a  non-material  surface,  i.e.  different 
particles  of  the  body  are  located  on  St  at  different  times. 

Let  n  denote  a  unit  normal  vector  and  Vn  the  normal  velocity  of  a  point  x  on  St-  We 
shall  speak  of  the  side  into  which  n  points  as  the  positive  side  of  St-  If  p(x,  t)  is  a  generic 
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field  quantity  that  varies  smoothly  away  from  St  and  suffers  a  jump  discontinuity  across 
St,  we  let  P{x,t)  and  P(x,  t)  denote  the  limiting  values  of  p  at  a  point  x  on  St  and  write 
[[P(M)1]  =  P(x,t)  -P(x,t). 

Propagation:  We  now  list  the  various  jump  conditions  that  must  hold  on  <S(,  using 
a  Lagrangian  description  throughout.  Continuity  of  the  deformation  across  St  implies 
that 


[Ml  +  V„[{F]]n  =  0  on  St,  (2.1) 

where  v(x,  t)  denotes  particle  velocity.  If  p(x)  is  the  mass  density  of  the  body  in  the 
reference  configuration  and  S(x,  t)  is  the  first  Piola-Kirchhoff  stress  tensor,  the  balance 
of  linear  momentum  leads  to 

[[Sn]]  +  [[pv]]V„  =  0  on  St.  (2.2) 

Next,  let  q(x,t)  represent  the  nominal  heat  flux  vector  and  e(x,t)  the  internal  energy 
per  unit  mass;  then  the  first  law  of  thermodynamics  requires  that 

[[Sn  ■  v]]  +  [[pe  +  p\  •  v/2]]V/„  +  [[q  •  n]]  =  0  on  St.  (2.3) 

Finally,  the  second  law  of  thermodynamics  leads  to 

[\pvWn  +  [[q  •  n]]/0  <  0  on  St,  (2.4) 

where  0  is  absolute  temperature  and  77  is  entropy  per  unit  mass.  The  various  physical 
quantities  just  introduced  are  assumed  to  vary  smoothly  away  from  St,  and  with  the 
exception  of  #,  are  permitted  to  suffer  jump  discontinuities  across  St-  The  temperature 
0  is  required  to  be  continuous.  (In  the  event  that  heat  conduction  is  neglected  and  the 
process  is  modeled  as  being  adiabatic,  one  must  relinquish  this  assumption  on  temper¬ 
ature  and  permit  0  to  jump  across  St,  as  in  the  classical  theory  of  shock  waves  in  gas 
dynamics.) 

The  jump  conditions  (2.1  )-(2.4)  impose  restrictions  on  the  field  quantities  on  the  two 
sides  of  the  surface  of  discontinuity  St.  When  this  surface  corresponds  to  a  shock  wave, 
there  are  no  further  conditions  to  be  imposed  on  St-  On  the  other  hand  if  St  describes  a 
phase  boundary,  particles  transform  from  one  phase  to  another  as  they  cross  this  moving 
surface  and  it  is  necessary  to  account  for  the  process  (i.e.  kinetics)  of  this  transition  in 
a  suitable  manner;  in  order  to  do  this  within  the  continuum  theory,  one  must  supply 
further  information  on  St,  in  addition  to  (2.1)-(2.4).  The  need  for  such  supplementary 
information  can  be  made  clear  by  considering  the  physics  of  phase  transitions  as  described 


59 


in  the  materiais  science  literature,  e.g.  [13];  or,  by  solving  particular  initial-boundary- 
value  problems,  and  thereby  demonstrating  the  tremendous  lack  of  uniqueness  of  solution 
that  exists  in  the  absence  of  such  information,  [1,3];  or,  by  the  constitutive  formalism 
associated  with  non-equilibrium  thermodynamic  processes  as  outlined  below. 

Suppose  for  the  remainder  of  this  section  that  St  represents  a  phase  boundary.  Con¬ 
sider  a  sub-region  T>  of  TZ  and  let  T(t;  V)  be  the  rate  of  entropy  production  at  time  t 
associated  with  the  particles  in  V.  By  using  the  basic  field  equations  and  jump  condi¬ 
tions,  one  can  express  T  in  the  form 


r«;  V)  =  /  ffjv  +  /  5 d£!Edv  +  /  !£ja. 

JV  6  Jv  Q2  Js,nV  0 


q  •  GradB 


}Vn 


(2.5) 


Here  8(x,t)  is  the  internal  dissipation  defined  by  8 
/(x,  <),  defined  by 


Or)  S  ■  F / p  —  i  (see  [47]),  and 


/  =  Ml  -  1(S  +  §)  |[F]] 


on  St, 


(2.6) 


is  known  as  the  driving  force  acting  on  the  surface  St-  In  writing  (2.6)  we  have  introduced 
the  Helmholtz  free-energy  per  unit  mass  ip  =  e  —  t]0.  In  the  case  of  quasi-static  processes, 
one  can  provide  a  purely  mechanical  interpretation  for  / n  as  a  force  per  unit  area  exerted 
pointwise  on  St  by  the  body;  this  suggests  the  alternative  name  driving  traction  for  /. 

The  representation  (2.5)  for  T  decomposes  the  total  rate  of  entropy  production  into 
the  sum  of  three  parts:  the  first  arises  from  local  dissipation  in  the  bulk  material  away 
from  St,  the  second  is  the  contribution  due  to  heat  conduction,  and  the  third  represents 
the  entropy  production  rate  arising  from  the  moving  phase  boundary  St-  Suppose  from 
hereon  that  the  material  is  thermoelastic ,  i.e.  that  it  is  characterized  by  a  Helmholtz 
free-energy  potential  ip(F,0)  such  that 


ip(x,t)  =  0(F(x,O,0{x,O),  S  —  pipy ,  V  =  -i'e- 


(2.7) 


Then  the  internal  dissipation  8 ,  and  therefore  the  associated  entropy  production  rate, 
both  vanish.  If  the  body  is  in  thermal  equilibrium,  0(x,t)  =  constant ,  the  entropy 
production  rate  due  to  heat  conduction  vanishes;  if  the  body  is  in  phase  equilibrium, 
by  which  we  mean  the  driving  force  /(x,  t)  vanishes,  the  entropy  production  rate  due  to 
the  moving  phase  boundary  vanishes.  Thus,  the  temperature  gradient  GradO  and  the 
driving  force  /  are  the  “agents  of  entropy  production”  when  a  thermoelastic  body  is  not 
in  thermal  and  phase  equilibrium;  one  speaks  of  (GradO) JO2  and  f  jO  as  thermodynamic 
“affinities”  and  the  conjugate  quantities  q  and  Vn  in  the  entropy  production  rate  (2.5)  as 
the  corresponding  “fluxes”,  e.g.  Callen  [12],  Truesdell  [47].  Note  that  while  q  represents 
heat  flux,  pVn  represents  the  flux  of  mass  crossing  the  propagating  phase  boundary. 
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The  presence  of  non- vanishing  affinities  (Grad0)/02  and  / j0  indicates  a  lack  of  ther¬ 
mal  and  phase  equilibrium;  according  to  the  theory  of  non-equilibrium  processes  as  de¬ 
scribed  in  the  aforementioned  references,  the  evolution  towards  a  state  of  equilibrium 
is  controlled  by  constitutive  laws  which  relate  each  flux  to  its  conjugate  affinity,  and 
possibly  to  other  variables  as  well.  In  the  present  case,  this  suggests  the  need  for  two 
constitutive  relations:  q  =  Q(Grad6,  F,  9)  in  the  bulk  material  and 

Vn  =  V(f,0)  on  St,  (2.8) 

representing  a  heat  conduction  law  and  a  kinetic  law  respectively.  The  entropy  inequality 
(2.4)  can  be  written  equivalently  as 

fVn>0  on  St,  (2.9) 

and  so  the  constitutive  function  V  must  be  such  that 

V(f,0)f>  0.  (2.10) 

If  V  is  continuous,  this  implies  that  V^O,#)  =  0,  indicating  that  the  flux  Vn  also  vanishes 
when  the  body  is  in  phase  equilibrium. 

The  basic  principles  of  the  continuum  theory  do  not  impose  any  further  restrictions 
on,  or  provide  examples  of,  the  kinetic  response  function  V.  These  must  be  supplied  by 
appropriate  constitutive  modeling,  two  examples  of  which  are  given  in  Sections  3  and  6. 
In  the  special  case  of  processes  taking  place  close  to  phase  equilibrium,  it  is  natural  to 
replace  the  relation  (2.8)  by  its  linear  counterpart  Vn  =  u(0)f  with  u  positive. 

Energy-wells:  In  general,  a  propagating  surface  St  could  be  either  a  phase  boundary 
or  a  shock  wave.  Shock  waves  are  intimately  connected  with  inertia  and  do  not  exist  in 
the  absence  of  such  effects.  In  order  to  focus  attention  on  features  that  aie  unique  to 
phase  boundaries,  it  is  therefore  useful  to  consider  a  quasi-static  setting.  We  close  this 
section  with  some  discussion  on  the  characteristics  of  the  potential  energy  function  G , 

G(F;S,0)  =  /^(F,0)-S-F,  (2.11) 

associated  with  a  two-phase  material.  It  should  be  noted  that  while  the  Gibbs  free- energy 
g(F,0)  —  pil>(Y,0)  —  /u/>f(F,  6)  •  F  is  distinct  from  the  potential  energy  function  G,  their 
values  coincide  at  extrema  of  G(-;  S,  9). 

Let  St  represent  a  quasi-statically  propagating  phase  boundary,  and  consider,  and 
fix,  an  instant  t  and  a  point  x  on  St.  At  this  (x,  <),  let  n  be  a  unit  vector  normal  to 

St,  let  9  be  the  temperature,  and  let  F  and  S  be  the  limiting  values  of  the  deformation 
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gradient  tensor  and  the  first  Piola-Kirchhoff  stress  tensor  respectively.  Continuity  of  the 
deformation  requires  that  F=F  +b  <g>  n  for  a  suitable  vector  b;  continuity  of  traction 
requires  that  S  n  =S  n.  In  addition,  suppose  that  the  phases  on  the  two  sides  of  Si  are 

*  i 

metastable  in  the  sense  that  the  free-energy  V’(F,  $)  is  strongly  elliptic  at  F  =F;  see  [48]  for 
a  discussion  relating  strong  ellipticity  to  infinitesimal  stability.  Finally,  the  formula  (2.6) 

for  the  driving  force  simplifies,  in  the  present  quasi-static  setting,  to  /  =  p[[?/;]j  —  S  ■  [[F]]. 

We  now  consider  the  potential  energy  function  G( F;  S,  0)  and  examine  some  of  its  fea¬ 
tures.  Similar  results  hold  for  G(F;S,0)-  Consider  the  set  A  =  A(F,  n)  of  all  nonsingular 
tensors  which  are  kinematically  compatible  with  F: 

•4={F|F  =  F  +  a®n  for  all  vectors  a},  (2.12) 

and  consider  the  restriction  of  the  function  G(-;S,0)  to  the  set  A.  On  making  use  of 
the  traction  continuity  condition  and  the  constitutive  relations  S=  P0f(F,  0),  one  finds 
that  G,  as  a  function  of  a,  has  extrema  at  a  =  0  and  a  =  b  (corresponding  to  F  =F 
and  F  =F).  Moreover,  the  requirement  of  strong  ellipticity  can  be  used  to  show  that 
these  extrema  are  local  minima.  Figure  1  shows  a  schematic  graph  of  G(F;S,0)  versus 
F  showing  two  local  minima  (“energy- wells”)  at  F  =F. 

tG(F;S,©) 


Figure  1.  Schematic  graph  of  potential  energy  function  C  versus  deformation  gradient  tensor 

F. 
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Finally,  one  sees  that  the  difference  between  the  values  of  G  at  the  two  minima  F  =F 
is  equal  to  the  driving  force:  /  —G  —  G—9  —  9.  Suppose  that  the  minimum  at  F  =F  is 

less  than  the  minimum  at  F  =F  as  is  the  case  depicted  in  Figure  1;  then  /  is  positive, 
so  that  by  the  entropy  inequality  fVn  >  0,  one  has  Vn  >  0.  The  phase  boundary  St 
therefore  moves  into  its  positive  side,  creating  more  materia)  in  the  low-minimum  phase 
at  the  expense  of  material  in  the  high-minimum  phase.  Thus  the  phase  corresponding 
to  the  lesser  minimum  of  G  is  preferred.  The  same  is  true  when  /  is  negative.  Thus  we 
may  speak  of  the  phase  associated  with  the  lesser  minimum  of  G  as  being  stable  (or  at 
least  as  being  more  stable  than  the  phase  associated  with  the  higher  minimum). 

3.  A  CONSTITUTIVE  MODEL 

From  here  on  we  restrict  attention  to  a  purely  one-dimensional  setting  for  a  uniaxial  bar. 
In  the  present  section  we  describe  a  simple  explicit  constitutive  model,  consisting  of  a 
Helmholtz  free-energy  function,  kinetic  relation  and  nucleation  criterion.  Its  implications 
will  be  discussed  in  the  sections  to  follow. 

Helmholtz  free-energy  function:  Let  6C  be  the  critical  temperature:  when  0  >  0C 
the  material  exists  in  only  a  single  phase,  no  matter  what  the  stress-level.  On  the  other 
hand  for  0  <  0  <  6C,  we  assume  that  two  metastable  phases  exist  if  the  stress  a  lies  in  a 
suitable  range  (0)  <  a  <  <7^(0),  and  that  only  one  phase  exists  otherwise. 

The  potential  energy  function  G(-)\<t,  0)  —  —  07  that  characterizes  the  ma¬ 

terial  must  therefore  be  such  that  G(-;a,0)  has  two  local  minima  when  0  <  0  <  0C, 
<rm(0)  <  o  <  and  a  single  minimum  otherwise. 

This  implies  that  the  stress-strain-temperature  relation  a  =  <7(7, 0)  =  pip~,(^,0)  must 
have  the  following  properties:  for  0  >  0C,  a  must  be  a  monotonically  increasing  function 
of  7.  For  0  <  0  <  0C,  &  must  first  increase  with  7  on  a  range  —  1  <  7  <  7 a*(0),  then 
decrease  over  an  intermediate  range  7 M(0)  <  7  <  7 m(0),  and  finally  increase  again  for 
7  >  lm(9)-  We  shall  assume  that  <7  has  the  trilinear  form  shown  in  Figure  2: 


(  P7  -  pa{9  -  0T),  -1  <  7  <  7 m(0),  0  <  0  <  0C, 

a  =  0(7,0)  =  •{  (3.1) 

l  nil  ~  It)  -  fia{0  ~  0t),  7  >  7®(^)>  0  <  9  <  0C, 

where  we  have  omitted  displaying  the  equation  of  the  declining  branch.  Though  the 
figure  has  been  drawn  with  <rm(0)  negative,  this  need  not  be  the  case.  The  restrict  ion  7  > 
—  1  arises  from  the  requirement  that  the  deformation  be  one-to-one.  The  relationships 
between  <tm(9),  <rm(0)  and  7 m{0),  7 m{0)  niay  be  read  from  Figure  2.  We  take  ')m(O)  and 
Jm(0)  to  be  linear  functions  of  0  that  coincide  at  0  =  0r. 

The  two  rising  branches  of  the  stress-strain  curve  are  associated  with  distinct  phases 
of  the  material  which  we  refer  to  as  the  low-strain  phase  and  the  high-strain  phase.  In  the 
simple  model  here,  both  phases  have  the  same  constant  elastic  moduli  p  and  coefficients 


of  thermal  expansion  a.  The  constant  7r{>  0),  representing  the  horizontal  distance 
between  the  two  rising  branches  of  the  stress-strain  curve,  is  the  transformation  strain. 
We  require  9 r  to  be  the  transformation  temperature ,  so  that  both  phases  exist  and  both 
are  stable  when  9  —  &T,cr  —  0;  this  means  that  G(;0 ,<?•/•)  to  have  two  local  minima, 
and  that  the  values  of  G  at  these  minima  are  to  coincide. 


Figure  2.  The  trilinear  stress-strain  curve  at  a  fixed  temperature  6. 


By  integrating  the  relation  =  ar(-y,0)  with  respect  to  7,  and  enforcing  the 

requirements  that  (i)  ^>(7,$)  be  continuously  differentiable  for  7  >  —  1,  0  <  0  <  0C,  (ii) 
that  both  phases  have  the  same  constant  specific  heat  at  constant  strain  c  =  —04’ee{‘t,  0), 
and  (iii)  that  0j  be  the  transformation  temperature,  one  finds 


P^{  7>0)  =  ' 


f  72  -  pa~/{0  -  9r)  -  pcO  log{0/0T), 


(3.2) 

f  (7  -  It)2  -  -  7r)(0  -  &t)  -  pcdtog{0/0T)  +  ^f{9  -  0T ), 


where  (3.2)j  holds  for  the  low-strain  phase,  (3.2)2  for  the  high-strain  phase.  The  constant 
Aj  can  be  shown  to  represent  the  latent  heat  of  the  transition  at  the  transformation 
temperature  9j.  In  view  of  the  particularly  simple  manner  in  which  the  temperature  0 
enters  into  the  present  model,  one  would  not  expect  it  to  be  realistic  when  9  is  far  from 
the  transformation  temperature  0j. 

The  stress-level  cro(0),  defined  by 


tfo(0)  =  (p^tHtQt)^  -  0r)> 


(3.3) 


f>4 


is  known  as  the  Maxwell  stress  and  has  the  geometric  significance  shown  in  Figure  2 
where  the  two  shaded  areas  are  equal.  One  can  readily  verify  that  when  a  <  <ru(0 j,  the 
low-strain  minimum  of  the  potential  energy  function  G(-  ;o,0)  is  smaller  than  the  high- 
strain  minimum,  while  the  converse  is  true  for  a  >  <7o(0).  Thus  the  low-strain  phase  is 
stable  when  a  <  cr0($),  the  high-strain  phase  is  stable  when  a  >  ao{0 ),  and  both  phases 
are  stable  when  a  —  cro(0).  Observe  from  this  and  (3.3),  that  if  the  latent  heat  A y  >  0, 
the  low-strain  phase  is  stabie  at  high  temperatures,  while  the  high-strain  phase  is  stable 
at  low  temperatures;  thus  Ay  would  be  positive  when  the  low-strain  phase  is  ausUtnte 
and  the  high-strain  phase  is  martensite.  If  the  converse  is  true,  then  Ay  <  0. 


Figure  3.  Phase  boundary  velocity  as  a  function  of  driving  force  at  various  temperatures. 

Kinetic  relation:  Let  x  ~  s(t)  denote  the  location  of  a  phase  boundary  during  a 
quasi-static  process,  and  suppose  that  the  particle  at  x  —  s—  is  in  the  high-strain  phase 
and  the  particle  at  x  —  s-f  is  in  tin'*  low-strain  phase.  The  driving  force  /  on  this  phase 
boundary  can  be  calculated  by  substituting  (3.1)  and  (3.2)  into  /  =  {[/>0]J  —  <r [[-y] j,  thus 
leading  to 

/  =  f(°,0)  ~  1t[<J -MG)]-  (3.4) 

Since  the  bar  is  in  a  two-phase  state,  the  potential  energy  function  G(-;cr,0)  has 
two  local  minima,  one  at  7  =7,  the  other  at  7  —7,  and  they  are  separated  by  a  local 


maximum;  this  is  analogous  to  the  case  depicted  schematically  in  Figure  1.  As  the 
phase  boundary  moves  through  the  bar,  the  partiele  immediately  ahead  of  it  "jumps 
from  one  local  minimum  of  (1  to  the  other,  and  the  underlying  kinetic  relation  ran  be1 
constructed  by  viewing  this  jumping  process  at  the  atomic  scale.  In  particular,  if  this 
process  is  “thermally-activated”,  classical  arguments  based  on  the  idea  that  atoms  jump 
from  one  local  minimum  to  the  other  when  they  possess  sufficient  energy  to  cross  over 
the  intermediate  energy  barrier,  leads  to  the  following  kinetic  relation: 


i  =  v(a ,  0)  =  It  <  exp 


It 


\<r 


2rlc0  aA,(0)  -  a,n(0) 


—  exp 


2 r  W  -  VmjO)]1 

‘irkd  (j { 0)  -  rrm(0) 


,(3.5) 


where  R  and  r  are  positive  material  constants,  k  is  Boltzmann's  constant,  and  rr\i{0) 
and  <rm(0)  have  the  meaning  shown  in  Figure  2.  One  can  use  (0.1)  to  express  (3.5)  in  the 
alternate  form  ,s  =  V(f,0).  Graphs  of  \  '  —  \  j  li  versus  /'  --  ///*'// ,  at  different  vahu’s 
of  temperature  O'  =  (0  —  0j)/0f,  are  shown  in  f  igure  3.  One  can  show  that  V  {f,0)f  >0 
as  required  by  (2.10),  and  also  t hat.  G  is  an  increasing  function  of  /  at  fixed  0.  'I  he 
variation  of  V  with  temperature  0  at  fixed  /  is  more  complicated;  see  [7]. 

Nucleation  criterion:  Consider  a  process  in  which  the  entire  bar  is  in  a  single-phase 
configuration  for  an  initial  interval  of  time  and  in  Uvo-phase  states  for  subsequent  times. 
The  instant  of  time  and  the  point  in  the  bar  at  which  the  transition  from  the  single-phase 
to  the  two-phase  configuration  is  initiated  are  determined  by  a  nucleation  criterion. 

Addressing  first  the  question  of  location,  in  reality,  the  nucleation  site  is  strongly 
influenced  by  inhomogeneities.  Since  our  bar  is  uniform,  the  location  of  the  nucleation 
site  is  rather  arbitrary.  In  the  case  of  a  bar  rendered  inhomogeneous  by  means  of  a  slight 
uniform  taper  with  the  small  end  at  x  —  0,  the  low  strain  phase  to  high-strain  phase 
transition  would  necessarily  commence  at  x  =  0  and  the  reverse  transition  at  x  =  /..  We 
shall  arbitrarily  assume  that  this  is  the  case  in  our  uniform  bar  as  well. 

Turning  next  to  the  conditions  under  which  nucleation  occurs,  we  assume  that  each 
transition  is  initiated  at  a  critical  value  of  driving  force,  a  value  /.(>  0)  for  t  he  low-strain 
phase  to  high-strain  phase  transition  and  a  value  /..(<  0)  for  the  reverse  transition.  By 
making  use  of  (3.4),  one  can  restate  this,  for  quasi-static  processes,  in  terms  of  critical 
values  of  stress  and  temperature:  the  low-strain  phase  to  high-strain  phase  transition 
is  initiated  when  a  —  (T[ ,-,u{0)  while  the  reverse  transition  is  initiated  at  a  —  (Th^/XO) 
where 


*l-//(0)  =  (ph/rr0r)(0  ~  0 .),  <7 s  (rh/rrOrHO  -0„):  (3.6) 

0,  and  0„  denote  material  constants  which  may  be  readily  related  to  the  critical  nu¬ 
cleation  values  /.  and  /.»  of  driving  force.  Observe  that  when  the  stress  vanishes,  this 
nucleation  criterion  states  that  the  low-strain  phase  to  high  strain  phase  transition  is 
initiated  at  0  =  0 .,  the  reverse  transition  at  0  =  0 ...  Thus  when  A/-  >  0,  corresponding 
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to  the  case  where  the  low-strain  phase  is  austenite  and  the  high-strain  phase  is  marten¬ 
site,  9.  and  6 represent  the  “martensite  start  temperature”  and  the  “austenite  start 
temperature”  respectively:  6.  =  Ma,  —  A,.  The  converse  is  true  if  Ay  <  0. 

4.  THERMO-MECHANICAL  RESPONSE 

Consider  a  bar  composed  of  the  material  described  in  the  preceding  section.  Let  /, 
be  its  length  in  the  reference  configuration  and  6  its  elongation  from  this  state.  We 
consider  quasi-static  processes  during  which  the  bar  is  always  at  a  uniform  stress  a(t) 
and  temperature  9(t).  Suppose  that  the  temperature  9(t)  and  one  of  the  two  mechanical 
quantities  6(t)  and  a(t)  are  prescribed  and  that  we  wish  to  determine  the  remaining 
mechanical  variable. 

Suppose  that  initially,  for  0  <  t  <  £„,  the  entire  bar  is  in  the  low-strain  phase;  by 
(3.1),  the  elongation,  stress  and  temperature  are  then  related  by  6/ L  =  aj (i  A  a(9  —  0o). 
At  the  instant  t,,  the  high-strain  phase  is  nucleated  at  the  left  end  of  the  bar  so  that  by 

(3.6)!, 


<?(*.)  =  (p^t/7t0t)(0(*.)  -  &.]•  (4.1) 

For  some  time  thereafter,  the  bar  is  in  a  two-phase  configuration  with  the  high-strain 
phase  on  0  <  x  <  s(t),  the  low-strain  phase  on  s(<)  <  x  <  L;  by  (3.1),  one  now  finds 
that  6/ L  =  7ys/ L  +  cr/p  -f  ot(9  —  90).  Differentiating  this  with  respect  to  time  and  using 
the  kinetic  relation  (3.5)  to  eliminate  s  leads  to 

8/L  —  d/p  —  a0  =  (~fT/ L)v(cr,0)  (4.2) 

with  v(cr,9)  given  by  (3.5).  The  response  of  the  bar  during  this  second  stage  is  governed 
by  the  initial- value  problem  consisting  of  (4.2),  (4.1).  If  at  some  subsequent  time  the 
phase  boundary  reaches  one  end  of  the  bar,  then  the  bar  is  again  in  a  single- phase 
state,  and  its  response  can  be  determined  directly  from  (3.1).  If  the  bar  is  entirely 
in  the  high-strain  phase,  and  the  high-strain  phase  to  low-strain  phase  transition  is 
nucleated  at  some  instant  <»,,  the  initial  condition  (4.1)  must  be  replaced  by  a(tm.)  = 
(pAT/7ydy)[0(t„)  —  0„]. 

In  order  to  study  the  response  of  the  model  in  such  processes,  we  carried  out  some 
simple  numerical  computations  for  three  types  of  loading.  In  all  cases  the  bar  was  initially 
in  the  low-strain  phase.  The  specific  values  of  the  various  material  parameters  used  may 
be  found  in  [7];  in  particular,  we  took  Ay  >  0  corresponding  to  the  low-strain  phase  being 
austenite  and  the  high-strain  phase  being  martensite.  The  non-dimensional  quantities 
referred  to  in  the  results  shown  in  Figures  4-7  are  the  stress  a'  =  cr//i7y,  temperature 
9'  =  (0  —  9t)/9t ,  elongation  A'  =  A/^7y  where  A  is  the  elongation  from  the  initial 
configuration  (rather  than  from  the  reference  configuration),  and  the  eiongation-rate 
q‘  —  6/ Ryr-  Consider  the  following  three  loading  programs: 


Figure  4.  Mechanical  cycling  at  various  temperatures,  all  at  the  same  elongation  rate. 


Figure  5.  Mechanical  cycling  at  various  elongation  rates,  all  at  the  same  temperature. 


(i)  The  initially  stress-free  bar  is  loaded  isothermally  at  a  constant  elongation-rate 
8  >  0.  At  some  subsequent  time,  loading  is  replaced  by  unloading  at  the  rate  -8,  still 
at  the  same  temperature,  until  the  stress  returns  to  zero.  Figure  4  shows  the  result¬ 
ing  response  for  different  values  of  9  at  the  same  value  of  8.  Observe  that  permanent 
deformation  occurs  at  sufficiently  low  temperatures,  but  does  not  occur  at  sufficiently 
high  temperatures.  Note  also  that  the  principal  efTect  of  raising  temperature  is  to  shift 
the  hysteresis  loop  in  the  direction  of  increasing  stress,  without  significantly  altering  its 
vertical  thickness.  Figure  5  shows  the  effect  of  different  elongation-rates  6  at  the  same 
value  of  9.  Observe  that  at  sufficiently  slow  elongation-rates,  the  force  decreases  immedi¬ 
ately  after  nucleation,  whereas  at  higher  rates  the  force  increases.  In  Figures  5(a),(b),(c), 
unloading  commenced  before  the  entire  bar  transformed  to  the  high-strain  phase,  while 
in  Figure  5(d)  unloading  started  after  the  low-strain  to  high-strain  transition  was  com¬ 
plete.  These  observations  are  qualitatively  similar  to  experimental  observations  on  some 
shape-memory  alloys,  e.g.  AgCd  ([30]),  NiTi  ([36]),  CuZnSi  ( [38] )  and  CuAINi  ([33]). 

(it)  The  bar  is  loaded  thermally  at  constant  stress  a  by  initially  cooling  it  at  a  constant 
temperature- rate  9  <  0  and  later  heating  it  at  a  rate  —9.  Figure  6  shows  the  resulting 
response  at  different  values  of  stress  and  the  same  cooling/heating  rate  ]0j.  Observe 
that  the  effect  of  increasing  the  stress-level  is  to  shift  the  hysteresis  loop  in  the  direction 
of  increasing  temperature  without  significant  change  in  its  thickness.  This  feature  has 
been  observed  in  InTl  by  Burkart  and  Read  [11]  in  experiments  conducted  at  compn-ssive 
stress. 


Figure  6.  Thermal  cycling  at  various  stresses,  all  at  the  same  cooling  rate. 

(Hi)  Consider  a  stress-free  bar  at  the  transformation  temperature;  thus  initially,  a  = 
0,9  =  9t,8  =  0.  The  bar  is  now  subjected  to  a  four-stage  thermo- mechanical  loading 
process:  first,  it  is  stretched  isothermally  at  a  constant  elongation-rate  8  >  0  (loading 
to  the  curve  OAB  in  Figure  7).  After  the  bar  has  fully  transformed  to  the  high-strain 
phase,  it  is  then  mechanically  unloaded  back  to  zero  stress,  still  at  9  —  9j,  (curve  BC). 
The  bar  is  now  at  cr  =  0, 6  =  6 but  it  is  permanently  deformed  6  >  0.  From  hereon  the 
stress  in  the  bar  is  maintained  at  zero.  In  stage  three,  the  bar  is  heated  at  a  constant 
rate  9  >  0  until  it  is  fully  transformed  to  the  low-strain  phase  (curve  CDE).  Finally  it 


is  cooled  back  to  the  initial  temperature  &  =  0T  (curve  EG).  This  returns  the  bar  to  its 
original  state  a  =  0,0  —  Oj,S  —  0,  demonstrating  the  so-called  shape-memory  effect. 


5.  INERTIAL  EFFECTS 

We  now  consider  the  effects  of  inertia,  and  for  simplicity,  make  the  unrealistic  assumption 
that  the  processes  of  interest  take  place  isothermally ,  0(x,t)  =  constant  (<  0C).  All 
thermal  variables  can  now  be  suppressed,  the  energy  field  equation  and  jump  condition 
being  satisfied  a  posteriori  by  applying  a  suitable  distribution  of  heat  sources.  The 
presence  of  inertial  effects  permit  a  discontinuity  to  be  either  a  shock  wave  or  a  phase 

boundary:  the  strains  7  and  7  on  either  side  of  a  shock  wave  both  belong  to  the  same 
phase  of  the  material,  and  so  are  associated  with  the  same  branch  of  the  stress-strain 
curve;  in  contrast,  a  phase  boundary  is  associated  with  two  different  phases,  and  therefore 
with  two  different  branches  of  the  stress-strain  curve. 

Consider  a  bar  of  uniform  cross-sectional  area  whose  stress-strain  relation  is  cr  =  <7(7) 
with  <7  having  the  trilinear  form  shown  in  Figure  2.  Let  x  =  s(t)  denote  the  location  of 
a  strain  discontinuity  during  a  motion  of  the  bar.  The  one-dimensional  counterparts  of 
(2.1),  (2.2),  (2.6)  and  (2.9)  must  hold  at  each  such  discontinuity.  By  combining  the  first 
two  of  these,  one  finds 


fl2l  z 

7-7 


>  0. 


(5.1) 


From  (5.1)  and  (3.1)  one  sees  that  shock  waves  in  both  the  high-strain  and  low-strain 
phases  have  the  same  constant  propagation  speed  c  =  (p/pff.  In  addition,  the  linearity 


of  the  stress  response  function  <7(7)  between  the  two  strains  7  and  7  allows  one  to  infer 
that  the  driving  force  /  on  a  shock  wave  vanishes.  Thus,  in  particular,  the  entropy 
inequality  fs  >  0  is  automatically  satisfied  at  all  shock  waves  in  this  trilinear  material. 

Turning  next  to  phase  boundaries,  it  is  readily  seen  from  Figure  2  and  (5.1 )  that  the 
propagation  speed  |i|  of  a  phase  boundary  cannot  exceed  the  shock  wave  speed  r;  thus 
phase  boundaries  in  this  material  are  necessarily  subsonic.  (This  would  not  be  true  if 
the  two  phases  had  different  elastic  moduli,  say  pn  and  pt,  with  pn  y?  pi;  such  a  case 
is  discussed  in  [3],  demonstrating  that  subsonic  and  supersonic  phase  boundaries  behave 
quite  differently.)  The  kinetic  relation  is  to  hold  at  all  phase  boundaries.  The  results 
that  follow  hold  for  any  kinetic  relation  of  the  form  s  =  V(f)  with  V(f)f  >  0  and  V 
monotonically  increasing  (and  are  not  restricted  to  the  particular  form  (3.5),  (3.4),  which 
in  any  case  was  constructed  by  assuming  the  process  to  be  quasi-static). 

In  order  to  elucidate  the  distinct  roles  played  by  the  entropy  inequality,  the  kinetic 
relation  and  the  nucleation  criterion,  we  now  discuss  a  particular  initial-value  problem. 
We  consider  an  infinite  bar  and  seek  solutions  of  the  fields  equations  cr'( 7)7*  —  pvt  —  0 
and  7 1  —  vx  =  0,  subject  to  the  corresponding  jump  conditions,  the  entropy  inequality, 
the  kinetic  relation,  the  nucleation  criterion  and  the  (Riemann)  initial  conditions 


7(1,0+) 


-{ 


7ft, 


—00  <  x  <  0, 
0  <  x  <  00, 


v(x,0+) 


vL,  -00  <  x  <  0, 
v/i,  0  <  x  <  00. 


(5.2) 


The  initial  strains  and  velocities  7£,,7r,V£  and  vR  are  given  constants  and  we  restrict 
attention  to  the  case  where  7^  and  -yR  are  both  in  the  low-strain  phase.  Finally,  it  will 
be  convenient  to  introduce  a  parameter  h,  having  the  dimension  of  stress,  that  depends 
only  on  the  given  data: 


h  =  (p/2)  [7i  +  7r  +  (vr  -  vL)/c j . 


(5.3) 


We  seek  solutions  that  are  piecewise  smooth  and  invariant  under  the  scale  change 
x  — >  kx,  t  — *  kt ,  where  k  is  a  constant.  Such  solutions  must  have  the  piecewise  constant 
form  shown  in  Figure  8  where  7 j  and  Vj  are  the  constant  values  of  the  strain  ~i(x,i) 
and  the  particle  velocity  v(x,t)  in  theyfA-wedge  of  the  (x,  <)-plane.  This  solution  form 
involves  N  strain  discontinuities  on  lines  x  =  s}t  which  may  be  shock  waves  or  phase 
boundaries.  For  the  trilinear  material,  fans  cannot  occur.  Since  the  strain  and  velocity 
fields  corresponding  to  Figure  8  are  piecewise  constant,  the  governing  field  equations  are 
satisfied  automatically.  The  one-dimensional  versions  of  the  jump  conditions  (2  1),  (2.2) 
and  the  entropy  inequality  fs  >  0  must  be  enforced  at  each  of  the  N  discontinuities.  We 
momentarily  postpone  consideration  of  the  kinetic  relation  and  the  nucleation  criterion. 

Entropy  inequality:  The  jump  conditions  and  entropy  inequality  can  be  invoked 
to  show  that  any  solution  of  the  form  shown  in  Figure  8  must  necessarily  take  one  of  the 
two  simpler  forms  shown  in  Figure  9:  when  the  initial  data  are  such  that  h  >  <70,  where 
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a0  is  the  Maxwell  stress  given  by  (3.3),  there  is  a  two-panumt  ter  family  of  solutions  of 
the  form  shown  in  Figure  9(b);  the  two  parameters  are  the  velocities  .s  and  i,  of  the  two 
phase  boundaries.  On  the  other  hand,  when  —  ft  <  h  <  crM,  there  is  a  solution  of  the 
form  shown  in  Figure  9(a)  involving  no  phase  boundaries.  For  initial  data  in  the  common 
interval  a0  <  h  <  <jm ,  both  types  of  solutions  exist. 

This  reveals  two  distinct  types  of  non-uniqueness.  First,  there  is  the  lack  of  uniqueness 
arising  because  of  the  undetermined  velocities  of  the  moving  phase  boundaries;  this  is 
exemplified  in  the  first  case.  Second,  for  initial  data  such  that  cr0  <  h  <  <7 m,  two  types 
of  solutions  can  occur,  one  in  which  the  bar  ultimately  changes  phase,  the  other  in  which 
it  does  not. 

Kinetics,  nucleation  and  uniqueness:  We  now  indicate  how  the  nucleation  crite¬ 
rion  and  the  kinetic  relation  resolve  this  lack  of  uniqueness.  Within  the  class  of  solutions 
for  which  the  bar  changes  phase  (Figure  9(b)),  it  is  possible  to  show  that  the  kinetic- 
relation  s  =  V(f),  with  V  monotonically  increasing,  serves  to  determine  the  velocities  of 
the  two  phase  boundaries,  so  that  the  Rjemann  problem,  when  augmented  by  the  kinetic 
relation,  has  a  unique  solution.  The  only  remaining  issue  arises  for  values  of  h  in  the 
intermediate  range  (<70,<7m];  in  this  case,  it  is  required  to  choose  between  precisely  two 
solutions,  in  one  of  which  the  bar  changes  phase  (Figure  9(b)),  while  in  the  other  it  does 
not  (Figure  9(a)).  In  order  to  make  this  choice,  one  appeals  to  the  nucleation  criterion, 
according  to  which  a  phase  transition  from  the  low-strain  phase  to  the  high-strain  phase 
is  initiated  at  a  critical  level  /  =  /,  of  driving  force.  It  is  possible  to  show  by  invoking 
this  criterion  that  the  change-of-phase  solution  is  selected  for  initial  data  in  the  range 
&L-+H  <  h  <  <jM,  while  for  cr0  <  h  <  (*l—H  the  criterion  picks  the  solution  with  no  phase 
change;  here  is  the  nucleation  stress-level  defined  in  (3.6). 

6.  EFFECT  OF  VISCOSITY  AND  STRAIN-GRADIENT 

By  modeling  the  materia]  as  being  thermoelastic,  we  neglect,  in  particular,  the  effects  of 
viscosity  and  strain-gradients.  Such  effects  could  have  important  local  consequences  in 
the  vicinity  of  a  phase  boundary,  even  if  it  is  appropriate  to  neglect  them  when  describing 
the  bulk  behavior  of  the  material  elsewhere  in  the  body.  In  this  section  we  describe  the 
influence  of  such  effects  in  a  one-dimensional  isothermal  setting. 

Suppose  that  the  effect  of  viscosity  and  strain-gradient  on  the  constitutive  relation  is 
to  replace  the  elastic  stress-strain  law  a  =  <7(7)  by 

0  ~  ^(7)  +  Pvlt  —  P^lxxi  (6.1) 

where  the  constants  u  >  0  and  A  >  0  are  the  viscosity  and  the  strain-gradient  coefficient 
respectively,  and  <7  is  the  trilinear  function  shown  in  Figure  2.  A  propagating  phase 
boundary  in  the  elastic  bar,  associated  with  the  states  (7,  v)  and  (7,  e),  corresponds,  in 
this  augmented  theory,  to  a  travelling  wave  that  smoothly  connects  these  two  states: 


7  =  7(4),  v  =  t;(4),  (  =  x-st, 


(6-2) 


where  i  is  a  constant.  The  functions  *>{£),  t»(£)  are  to  be  twice  continuously  differentiable 
on  (—00,00)  and  must  satisfy  the  field  equations  oI  —  pvt  =  0,  7,  —  vr  =  0  and  (G.l), 
and  the  boundary  conditions  (6.3).  The  trilinearitv  of  a  allows  one  to  determine  o(£) 
and  ?.’(£)  explicitly,  though  we  shall  not  display  the  result  here. 

Such  travelling  waves  y(£),  e(4)  have  a  limit  as  the  viscosity  v  and  the  strain-gradient 
coefficient  A  tend  to  zero,  with  the  dimensionless  parameter  uj  —  and  the  data 

“  -f- 

7,  t%  7,  t>,  held  fixed;  indeed,  7(£),  v(£)  tend  to  7,  v  for  £  <  0,  and  to  7,  v  for  <f  >  0  in  this 
limit.  The  limiting  functions  7(£),  v(£)  satisfy  all  of  the  differentia!  equations  and  jump 
conditions  of  the  elastic  theory;  in  addition ,  one  finds  that  the  limiting  discontinuity 
satisfies  a  particular  kinetic  relation  s  =  V'(/;w’)-  This  kinetic  relation  characterizes  the 
influence  of  viscosity  and  strain-gradient  on  a  phase  boundary  even  when  these  effects 
are  negligible  in  the  bulk  material. 

*  f  f  ; 


Figure  10.  The  kinetic  relation  induced  by  viscosity  and  strain-gradient  effects. 

The  kinetic  relation  inherited  in  this  way  may  be  fully  and  explicitly  determined, 
though  its  representation  is  too  complicated  to  display  here.  Figure  10  describes  it  by 
plotting  ///0  vs.  s/c  where  f0  =  +  7m).  One  finds  that  f f fa  depends  only 

on  the  three  material  parameters  rr.v//(7m, 7Af/7m  and  u;;  the  figure  is  plotted  for  fixed 
values  of  the  first  two  of  these  and  for  various  values  of  uj.  The  kinetic  response  function 
V  increases  nynotonically  with  /  for  the  values  of  the  parameters  considered.  It  can  be 
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shown  that  V  satisfies  tl'e  requirement  V(/;u.>)/  >  0  imposed  on  every  kinetic  response 
function  by  the  entropy  inequality. 

The  effect  of  viscosity  and  strain-gradients  (cv  capillarity)  on  shock  waves  in  gas 
dynamics  is  well-known.  Here  too,  the  discontinuities  are  smoothed-out  and  replaced  by 
narrow  zones  in  which  the  fields  vary  continuously  but  rapidly.  In  the  limit  of  vanishing 
viscosity  and  capillarity,  one  recovers  the  particular  shock  wave  that  is  associated  with  the 
states  (7,v),  (7,v)  and  all  traces  of  viscosity  and  capillarity  disappear  in  this  limit.  In 
contrast  for  phase  boundaries,  the  limit  of  vanishing  viscosity  and  strain-gradient  effects 
leads  to  a  particular  discontinuous  solution  from  among  the  many  that  can  be  associated 

—  _  +  4. 

with  the  states  (7,v),  (7,i>),  the  particular  phase  boundary  selected  by  this  limiting 
procedure  being  the  one  that  obeys  the  aforementioned  kinetic  relation.  Thus  the  effects 
of  viscosity  and  strain-gradient  persist  (in  the  form  of  a  kinetic  relation  s  =  V /(/;u.’)) 
even  in  the  limit  v,  A  — *  0,  through  the  presence  of  the  parameter  w. 
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SUMMARY 

Instability  is  one  of  the  factors  which  limit  the  extent  to  which  structures 
can  be  loaded  or  deformed  and  plays  a  pivotal  role  in  the  design  of  structures 
loaded  in  compression  but  also  in  tension.  The  major  issue  of  practical 
concern  is  the  establishment  of  the  critical  buckling  load  and  deformation,  that 
is,  the  lowest  load  or  smallest  deformation  at  which  alternate  equilibria 
become  possible.  This  is  usually  established  through  linearized  bifurcation 
analyses.  In  practice,  the  critical  condition  can  be  affected  by  imperfections, 
material  nonlinearities  and  other  factors.  Incorporating  these  in  the  problem 
usually  requires  solution  of  the  more  complete  nonlinear  problems. 

This  classical  approach,  that  is,  concern  about  the  onset  of  instability,  is 
sufficient  for  the  design  of  most  structures.  In  the  past  fifteen  years,  a  class  of 
structural  instabilities  have  been  identified  in  which  the  classical  approach  is 
not  sufficient.  In  general,  these  affect  structures  of  larger  size  in  which, 
following  the  onset  of  instability,  collapse  is  confined  to  a  relatively  small  part 
of  the  structure,  that  is,  it  localizes.  However,  under  prevailing  conditions,  the 
initially  local  collapse  can  propagate,  or  spread,  often  in  a  dynamic  fashion, 
over  the  rest  of  the  structure.  The  load  required  to  propagate  such  instabilities 
is  often  substantially  lower  than  that  required  to  initiate  them  in  the  original 
structure. 

In  practice,  propagating  instabilities  are  initiated  from  local  imperfections 
which  locally  weaken  the  structure  and  cause  local  changes  in  its  geometry 
(collapse).  Very  much  like  a  set  of  dominoes  once  the  geometric  integrity  of 
such  structures  is  compromised  locally,  the  instability  has  the  potential  of 
spreading  over  the  whole  structure. 

The  aim  of  this  lecture  is  to  demonstrate  the  breadth  of  relevance  of  the 
phenomenon  by  pulling  together  results  from  studies  of  four  structural 
examples  which  exhibit  this  behavior.  Although  the  practical  relevance  of  the 
examples  varies  from  insignificant  to  very  significant,  each  demonstrates 
different  aspects  of  this  behavior. 

The  first  problem  is  that  of  finite  inflation  of  an  elastic  tube.  This  problem 
demonstrates  the  complexity  of  localization-type  instabilities,  which  in  this 
case  takes  the  form  of  a  bulge,  and  the  subsequent  propagation  or  spreading  of 
the  bulge.  The  problem,  once  understood,  can  be  solved  quite  adequately  with 
relative  ease  using  finite-deformation  membrane  theory.  Thus,  it  provides  a 
model  problem  for  introducing  some  of  the  features  of  this  type  of  structural 
behavior. 
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The  second  problem  is  that  of  propagating  buckles  in  offshore  pipelines 
used  for  transporting  oil  and  gas.  Under  some  circumstances,  a  local 
disturbance  can  initiate  a  collapse  which  can  propagate  dynamically  along  the 
length  of  the  line  and  totally  destroy  it.  In  this  case  the  problem  is  governed  by 
interaction  between  geometric  nonlinearities  with  inelastic  characteristics  of 
the  pipe  material.  The  understanding  of  many  aspects  of  this  problem  has 
over  the  last  fifteen  years  reached  a  certain  degree  of  maturity  and  the 
progress  achieved  to  date  will  be  reviewed. 

The  third  problem  is  that  of  localized  collapse  and  the  subsequent 
spreading  of  this  collapse  in  shells  lining  stiffer  cylinders.  This  problem  has 
the  additional  nonlinearity  of  contact.  It  can  occur  for  linearly  elastic  as  well 
as  elastic-plastic  shells.  The  fourth  problem  involves  a  long  panel  with  an 
arch  cross  section,  loaded  by  uniform  pressure.  It  will  be  demonstrated  that 
following  the  classical  onset  of  instability  the  panel  can  develop  a  localized 
buckle  which  then  can  propagate  at  a  relatively  low  pressure  and  collapse  the 
whole  structure. 

It  will  be  shown  that  the  underlying  common  characteristic  of  structures 
exhibiting  this  class  of  instabilities  is  a  nonlinear,  local  load-deformation 
response  with  two  branches  with  positive  slope  joined  by  an  intermediate  one 
with  negative  slope,  as  shown  in  Fig.  1.  Thus,  for  some  range  of  loads,  such 
structures  have  three  possible  equilibria  for  each  value  of  load.  The  one  on  the 
first  ascending  branch  will  represent  prebuckling  deformations.  The  one  on 
the  second  ascending  branch  will  represent  buckled  configurations.  Under 
some  conditions,  the  two  states  can  co-exist  in  the  same  structure. 

Details  of  each  problem  including  experimental  and  analytical/numerical 
simulations  as  well  as  a  comprehensive  literature  review  can  be  found  in  the 
review  article  in  the  reference. 


Figure  1.  Local  “load-deformation”  response  characteristic  of  this  class  of 
problems 
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Abstract 

The  paper  presents  computational  approaches  to  predict  the  onset  and  growth  of  such  plastic 
instability  phenomena  as  buckling,  necking  and  flow  localization.  The  regularization  schemes 
to  remedy  the  problems  associated  with  spurious  mesh  sensitivity  and  incorrect  convergence 
in  finite  element  prediction  of  flow  localization  are  discussed.  The  instability  of  thick  circular 
tubes  deformed  under  pressure  and  combined  loading,  neck  and  bulge  propagation  in  poly¬ 
meric  materials,  wrinkling  of  plates  and  shells  under  forming  processes,  flow  localization  of 
thermo-elasto-viscoplastic  materials  under  various  deformation  rates,  and  flow  localization  of 
mono-  and  polycrystalline  solids  are  briefly  reviewed. 


1.  INTRODUCTION 

Instability  phenomena  in  plastic  solids  under  deformation  present  themselves  in  various 
ways.  The  decrease  in  the  stiffness  due  to  the  geometrical  change  and/or  material  softening 
caused  by  the  deformation  is  responsible  for  the  occurrence  of  such  instability  phenomena  as 
buckling,  necking  and  shear  banding.  Because  the  formation  of  these  instabilities  is  an  impor¬ 
tant  precursor  to  collapse  or  failure,  the  computational  prediction  of  the  onset  and  growth  of 
these  instabilities  is  indispensable  to  understanding  the  final  strength  of  the  structures  and 
materials. 

The  purpose  of  the  present  paper  is  to  provide  an  introduction  to  computational  predictions 
relating  to  plastic  instability  in  solid  mechanics.  The  basic  equations,  including  the  virtual 
work  principle  and  the  weak  form  of  the  energy  balance  equation,  and  constitutive  equations 
covering  various  aspects  of  material  response  are  first  outlined.  To  clarify  the  instabilities 
associated  with  the  bifurcation  which  governs  the  load-carrying  capability,  and  the  forming 
limit  of  the  thin-walled  bodies,  the  specializations  and  extensions  of  Hill's  theory  of  unique¬ 
ness  and  bifurcation  [I  j  to  cover  these  specific  problems  will  be  discussed.  The  computational 
strategies  for  the  prediction  of  deformation  behavior  deep  in  the  postbifurcation  range  are 
given. 

With  regard  to  shear-band-type  localization,  since  the  deformation  process  manifests  itself 
as  autocatalytic,  and  the  problem  likely  becomes  an  ill-posed  one,  the  computational  results 
exhibit  an  inherent  mesh  dependence.  Several  adaptive  remeshing  schemes  to  resolve  the 
sharp  strain  gradients,  and  regularization  procedures  proposed  to  remedy  the  mesh-dependent 
features  of  the  computational  results  are  briefly  reviewed. 

As  illustrative  examples,  instability  behaviors  associated  with  the  thick  circular  tubes  under 
pressure  and  combined  loading,  instability  propagation  in  polymeric  materials,  and  wrinkling 
behavior  of  plates  and  shells  under  different  forming  processes  are  briefly  reviewed.  Subse¬ 
quently,  the  flow  localization  behavior  of  thermo-elasto-viscoplastic  material  under  various 
deformation  rates  will  be  presented.  We  close  with  direct  physical  approaches  to  flow 
localization  of  mono-  and  polycrystalline  solids  which  strongly  depend  on  the  evolution  of 
microstructure,  and  their  interaction. 
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2.  BASIC  EQUATIONS 
2.1  Governing  Equations 

Consider  an  equilibrium  state  for  a  body  with  volume  V  and  surface  S  subjected  to  a  veloci¬ 
ty  constraint  on  Sv  and  traction  on  St.  An  updated  Lagrangian  formulation  is  employed  where 
each  material  particle  is  labeled  by  a  set  of  curvilinear  coordinates  x'  w  ith  metric  tensor 
which  are  embedded  in  the  body  in  the  current  state.  The  weak  form  of  the  equation  govern¬ 
ing  the  rate  of  stress  and  traction  yields  the  virtual  work  principle  [1,2], 


(S'4  o‘iV 
V 


)  SVi;jdV= 


6VdS, 


(2.1) 


where  6V  is  the  virtual  velocity  satisfying  the  homogeneous  boundary  condition  over  sur¬ 
face  S  .  Here,  Slj  is  the  Kirchhoff  stress,  which  is  identical  to  Cauchy  stress  o'J  in  the  cur¬ 
rent  configuration.  An  overdot  denotes  a  material  time  derivative  and  { ) ,  denotes  the  covar¬ 
iant  derivative  with  respect  to  current  coordinates.  Generally  the  configuration-dependent 
part  of  traction  rate  P  is  a  homogeneous  function  of  degree  one  in  the  velocity  Vj  and  veloci¬ 
ty  gradient  V  [3]  which  has  a  potential  for  a  specific  case.  We  can  generalize  this  equation  to 
a  dynamic  case  and  a  case  with  residual  non  equilibrium  force  at  the  previous  step. 

Meanwhile,  the  weak  form  of  the  energy  balance  equation  for  the  same  body  subjected  to 
heat  flux  q=-niq‘=Q  on  S  with  the  surface  unit  normal  ni  and  temperature  constraint  on  ST  is 


f  6TpcT dV  +  (&T  ic'jT  dV=  [  &TawpdV+  f  dTQdS,  (2.2) 

J  V  Jv  ''  J  V  J  s 

s 

where  p  ,  c  and  k'1  are  the  mass  density,  specific  heat  and  thermal  conductivity  tensor, 
respectively.  Here,  6T  is  the  variation  of  the  temperature  which  satisfies  the  homogeneous 
boundary  condition  on  Sr  The  fraction  of  irreversible  work,  wp,  which  is  converted  to  heat  is 
a  ,  where  a  is  in  the  range  of  0.85  to  0.95  for  many  metals.  The  specific  form  of  heal  flux  Q 
depends  on  the  respective  boundary  conditions. 

2.2  Constitutive  Equations 

Since  instability  predictions  strongly  depend  on  the  assumed  material  response,  here  consti¬ 
tutive  equations  which  have  been  employed  in  instability  analysis  are  briefly  summarized.  In 
elastic-plastic  material  response,  the  plastic  part,  Dpw,  of  the  strain  rate,  Dkl=(Vk,|+V ,4/2,  is 
specified  through  various  kinds  of  constitutive  equations.  Elastic  material  response  takes  the 
form  of  a  linear  relationship  between  the  elastic  strain  rate,  Dej,=Dj.-Dpj.,  and  a  suitable 
objective  stress  rate,  Sw.  Then  the  elastic-plastic  constitutive  equation  can  be  expressed  by 

S^L^Dy,  (2.3) 


where  Lijki  is  the  constitutive  tensor  and  depends  on  the  current  stress  and  the  deformation 
histories. 

Within  the  rate-independent  response,  for  the  rather  wide  class  of  materials  which  obey  the 
flow  rule,  Dpr  can  be  expressed  by 


Dp..=nmnSmnrn../h, 

ij  mn  ij 


(2.4) 


where  n  ,  m.  and  h  are  the  normal  to  the  yield  surface,  direction  of  plastic  strain  rate  and 
hardening  modulus,  respectively.  The  constitutive  equations  due  to  J,  flow  [1,4,5]  and 
kinematic  hardening  [6]  theories  and  to  many  anisotropic  theories  (see  refs.7,8)  fall  into  the 
special  case  of  Eq.(2.4)  with  a  =nv  .  In  Rudnicki  and  Rice  |9],  for  material  where  n  *nr.  and 
Smn  is  replaced  by  6  mn,  the  sp^cifi^  form  of  a  and  m(j,  for  example,  can  be  determir/cd  ny  the 


Gurson-type  yield  function  [10,11].  Several  extended  versions  accounting  lor  the  kinematic 
hardening  [12],  the  change  in  elasticity  modulus  due  to  void  volume  fraction  j  13]  and  the 
interaction  of  voids  [14]  were  also  suggested. 

The  deformation-type  constitutive  equation  by  Budiansky  [15]  was  focused  on  as  the  sim¬ 
plest  model  which  permits  the  development  of  a  corner  on  the  yield  surface.  It  has  been 
generalized  to  account  for  the  finite  strain  ]  16-18]  and  anisotropy  [19],  and  extensively 
employed  for  buckiing-  and  shear-band-type  instability  analyses.  The  plastic  strain  rate  is 


Dp  =n  Smnn  /h+(S'  -n  S“n.  )/h 

n  nm  it  V  n  nm  u'  s 


(2.5) 


where  hs  is  a  new  hardening  modulus.  This  constitutive  equation  is  validated  for  the  deforma¬ 
tion  satisfying  the  total  loading  condition  [  1 5 j.  For  the  strongly  nonproportional  stress  histo¬ 
ries,  Christolfersen  and  Hutchinson  [20]  proposed  the  comer  theory  in  which  angular  meas¬ 
ure,  0  ,  of  the  stress  rate  direction  with  respect  to  comer  direction  of  the  yield  surface  is  de¬ 
fined.  The  plastic  strain  rate  can  be  expressed  by 


d2Ep 


1 


Dp  - - 

1J  as‘j  dsu 


Su,  EPe=-f(0)CijklSiiSld, 


(2.6) 


where  Cijkl  is  a  plastic  compliance  tensor  of  Eq.(2.5).  The  transition  function  f(0)  is  unity 
throughout  the  total  loading  range  O<0<0a  and  zero  for  0c<H<ji  .  Here,  f(0)  decreases  from 
unity  to  zero  as  0  increases  from  0(i  to  0c.  The  Bauschinge;  effect  [21]  and  anisotropy  [22] 
have  been  introduced  in  Eq.(2.6). 

Constitutive  descriptions  of  single-crystal  plasticity  in  terms  of  crystallographic  shearing  on 
specific  slip  systems  have  been  given  by  Asaro  and  Rice  [23],  and  Asaro  [24].  In  the  current 
configuration,  the  plastic  strain  rate  is  described  by 

DPpI  P6>..  yW,  2P(a)..=se(a,jme(a,.+me<a)ise<a)j,  (2.7) 

where  (a)  designates  the  crystal-structure-dependent  (a)th  slip  system  with  the  slip  direction 
s'™,  and  normal  ra^' Un  the  current  state.  Here,  y00  is  the  shear  rate  on  slip  system  (a),  and  the 
summation  in  Eq.(2.7)  extends  over  the  active  slip  systems.  The  constitutive  equation  for  the 
single  crystal  can  be  established  by  Eq.(2.7)  and  the  evolutional  equation  for  the  flow  stress 
on  slip  system  (a).  In  the  description  of  the  hardening  moduli,  self-hardening  and  latent 
hardening  are  introduced  to  duplicate  the  experimental  observations  [24].  Recently,  more 
elaborate  descriptions  of  latent  hardening  have  been  proposed  [25]. 

The  materials  generally  possess  strain  rate  and  temperature  sensitivities  to  some  extent,  and 
they  have  an  important  effect  on  instability  behavior.  Generalized  versions  of  the  above- 
mentioned  constitutive  equations  accounting  for  the  thermo-elasto-viscoplastic  response  of 
the  materials  have  been  established  [26,27]  and  employed  in  thermocoupled  instability  analy¬ 
ses. 

In  the  meantime,  in  order  to  avoid  numerical  instability  and  maintain  the  required  accuracy, 
suitable  integration  schemes  of  constitutive  equations  are  employed  [28-30].  Iterative  meth¬ 
ods,  however,  which  can  achieve  numerically  stable  computational  processes  with  large 
increment,  may  have  the  drawback  of  creating  problems  associated  with  the  exact  account  of 
the  deformation  history. 


3.  BIFURCATION  AND  POSTBIFURCATION  ANALYSIS 
3.1  Bifurcation  Analysis 

The  bifurcation  criterion  employed  is  Hill' s  theory  of  uniqueness  and  bifurcation  [  1  ]  for 
clastic-plastic  solids.  This  theory  states  that  the  solution  is  not  unique  when  a  nontrivial  solu¬ 
tion  can  be  found  for  the  eigenvalue  problem  given  by  the  following  variational  equation: 
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(Lijk'D*  D*  +  o'JV'VW*  dV-lp*'V’dS. 

iy  'i  U  V  >'J  J  s 


(3.1) 


Here,  (  )*  denotes  the  difference  between  the  fundamental  solution  and  the  second  one.  The 
surface  integral  of  Eq.(3.1)  arises  from  the  configuration  dependence  of  loading  and  Lljkl  is 
assumed  to  be  a  constitutive  tensor  for  a  linear  comparison  solid.  When  plastic  loading  con¬ 
tinues  throughout  the  plastic  zone  in  the  fundamental  solution,  the  bifurcation  point  for  under¬ 
lying  materials  is  also  given  by  Eq.(3. 1). 

Now,  when  the  bifurcation  functional  in  Eq.(3.1)  is  approximated  by  the  finite  element 
method,  we  can  arrive  at  the  following  homogeneous  algebraic  equation: 


[K]{  S  >0,  (3.2) 

A,  # 

where  {  6  }  denotes  the  values  of  V  .  at  the  nodal  points.  When  this  homogeneous  equation 
has  a  nontrivial  solution,  bifurcation  may  occur.  The  bifurcation  condition  becomes 


det[K]=0, 


(3.3) 


and  the  corresponding  mode  is  obtained  as  the  eigenmode  of  Eq.(3.2).  At  every  computation¬ 
al  step  of  fundamental  analysis,  the  change  of  the  sign  of  det[K]  is  checked  and  then  an  itera¬ 
tive  method  is  used  to  determine  the  vanishing  point  of  det[K]. 

In  the  course  of  the  deformation  analysis  of  the  block  and  bar,  Kitagawa  et  al.  [31,32]  found 
that  the  sign  of  the  determinant  of  the  stiffness  matrix  for  the  finite  element  model,  which  has 
the  capability  of  expressing  bifurcation  mode,  changes  at  the  bifurcation  points  on  the 
fundamental  path,  and  the  associated  eigenmode  is  the  corresponding  bifurcation  mode.  This 
has  been  successfully  applied  to  various  kinds  of  bifurcation  problems.  A  similar  and  extend¬ 
ed  discussion  can  be  seen  in  refs.  33-35. 

Next,  we  consider  an  axisymmetric  body  subjected  to  axisvmmelric  deformation.  Here,  x1 
and  x2  are  curvilinear  coordinates  in  a  meridian  plane  and  x3  is  circumferential.  Nonaxisym- 
metric  admissible  velocities  V*.  are  considered  in  the  forms 


V’nu=V*m!cos  mx-\  V‘m2=V’m2sin  mx3,  V^V’^cos  mx3,  (3.4) 

where  m  is  the  mode  number,  and  V*^.  is  the  function  of  x1  and  x2  and  independent  of  x3. 
Substituting  these  into  Eq.(3.1),  we  arrive  at  the  bifurcation  functional  for  mode  m  by 


(L'^D*  D‘  +  o^*‘  ,V\,)rdA— f  P*1 
IJ  ’  l,J  J  c 


V*  rdS, 

mi  ’ 


(3.5) 


where  BV  and  VV  can  be  described  by  V*  and  its  derivatives  with  respect  to  x1  and  x2, 
and  mode  number  rn,  and  so  on.  Here,  r  is  tne  radial  distance  of  a  generic  point  in  the  axi¬ 
symmetric  body.  A  and  S  are  the  cross-sectional  area  of  the  body  and  its  boundary, 
respectively.  ^ 

The  finite  element  approximation  is  then  employed  to  describe  V*mi  over  an  element,  and 
the  corresponding  bifurcation  condition  in  Eq. (3.1)  becomes 


det[KJ=0(m=l,2,3...).  (3.6) 

The  first  bifurcation  point  is  referred  to  as  the  critical  bifurcation  point,  and  the  corresponding 
mode  is  the  critical  bifurcation  mode. 

When  the  materials  follow  the  nonassociative  flow  law,  the  bifurcation  condition  in  Eq.(3.1) 
becomes  invalid  because  of  the  nonsymmetry  of  tensor  L1Jkl  in  Eq.(2.3).  Raniecki  and  Brunhs 


[36]  introduced  two  comparison  solids,  through  which  the  search  for  the  genuine  bifurcation 
state  is  replaced  by  a  search  for  upper  and  lower  bounds.  Investigation  concerning  the  im¬ 
provement  of  the  lower  bounds  can  be  seen  in  (37|.  Furthermore,  for  materials  obeying  the 
nonlinear  constitutive  equation,  Tvergaard  |38j  and  Trianlafyllidis  [39|  proposed  methods  to 
determine  the  upper  and  lower  bounds,  respectively,  for  the  first  bifurcation.  See  also  the 
recent  paper  [40j. 

3.2  Wrinkling  Analysis  of  Thin-Walled  Bodies 

Consider  a  bifurcation  into  a  more  complicated  membrane  and  bending  type  of  deformation 
from  membrane  deformation.  Any  general  point  on  a  thin-walled  body  can  be  identified  by 
(yI»y3)(I— 1»2)  where  (y',y2)  are  curvilinear  coordinates  on  the  middle  plane,  and  the  y3  axis  is 
perpendicular  to  the  plane.  Now,  let  v.  and  V  be  the  velocities  of  the  body  at  (y‘,0)  and 
(y’,y3),  respectively.  According  to  Naghdi's  thtn-shell  theory  [41],  velocity  V.  can  be  ex¬ 
pressed  as 

VW+y3  S'  ,  V3=v3+y3  6\  (3.7) 

where  y36  describes  the  director  or  relative  velocity  at  a  point  (y\y3)  relative  to  the  velocity  at 
point  (y\(i).  Specifically,  the  Kirchhoff-Love  hypothesis  for  thin  shells  provides  the  follow¬ 
ing  relationship: 

6r-(v3.i+bLivi >  6-r0’  (3-8) 

where  b,,  is  the  curvature  tensor  of  the  middle  plane  and  vanishes  for  flat  plates.  When 
Eq.(3.7)  and  the  corresponding  velocity  gradient  and  strain  rate  are  substituted  into  Eq.(3.1) 
with  P*'=0,  the  bifurcation  functional  for  this  problem  becomes 


I.  |]JtrUMNd'ud*MN+4L13N3d‘13d‘N3+  o“(v"  M.»'  .„)} 

+t3/12{LlJMN  k’jj  k,mn+4L!3N3  k',3k  *  N3  +  «“( to*1  M  u)*u  +  w  *3M  u/y)}  |dA. 


(3.9) 


The  details  of  the  notations  are  explained  in  Tomita  and  Shindo  [42].  Tensor  L'^kl  is  the  plane 
stress  (o33=0)  constitutive  tensor,  where 

L'Jkl=L|Jkl-Llj33Lkl33/L3333,  (3. 10) 

which  is  uniform  throughout  the  thickness  because  of  the  assumption  of  a  membrane  slate 
prior  to  bifurcation.  Adopting  the  Kirchhoff-Love  hypothesis  in  Eq.(3.8)  for  the  bifurcation 
functionals  in  Eq.(3.9),  we  can  get  the  simplified  functional  of 

l,-|jl!L“MNd-1Jd-MN+oM3(v-|Mv'LI+v-M,v'J,)} 

+t3/12{LUMNK*u  k’mn+  o*"(  w  *'  M  u>*, U)*3M  u/y)}  ]d A.  (3. 1  1 ) 


Since  the  in-  and  out-of-plane  deformation  modes  are  coupled  with  each  other,  the  necking 
type  of  mode  is  affected  by  the  bending  type  of  mode  and  vice  versa.  On  the  other  hand,  for  a 
plate,  we  should  recall  that  b,j=(),  and  we  can  arrive  at  the  decoupled  bifurcation  functionals 
[42]. 

Due  to  the  complexity  ot  constructing  a  linite  element  which  satisfies  the  C1  continuity 
requirement  for  a  general  shell,  the  functional  in  Eq.(3.9)  is  recommended  for  use  in  such 
problems.  One  exceptional  case  is  the  problem  of  an  axisymmctric  shell  which  has  been 
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discussed  [42].  The  search  for  the  bifurcation  point  is  the  same  as  that  explained  before. 

3-3  Postbifurcation  Analysis 

The  solution  of  the  boundary  value  problem  at  the  bifurcation  point  can  be  expressed  by  the 
sum  of  the  fundamental  solution  at  the  bifurcation  point  and  a  suitably  normalized  bifurcation 
mode.  As  discussed  by  Hutchinson  [43),  the  specific  amplitude  of  the  bifurcation  mode  is 
determined  such  that  loading  occurs  everywhere  in  the  current  plastic  zone,  except  at  one 
point  where  neutral  loading  takes  place.  The  highly  nonlinear  nature  of  the  postbifurcation 
behavior  can  be  clarified  by  employing  the  virtual  work  principle  with  the  finite  element 
approximation.  When  the  bifurcation  point  obtained  from  Eq.(3. 1)  is  not  the  real  bifurcation 
point,  the  postbifurcation  behavior  must  be  traced  by  the  specially  developed  method  [40]  or 
calculated  approximately  using  bodies  with  initial  imperfections  through,  again,  the  virtual 
work  principle.  The  proper  magnitude  of  imperfection,  which  depends  on  the  problems  and 
the  number  of  significant  figures  of  the  employed  computational  facility,  and  of  the 
incremental  step  must  be  introduced  to  suitably  predict  bifurcation  and  postbifurcation 
behavior  [31,32,44], 


4.  FLOW  LOCALIZATION  ANALYSIS 

4.1  Problems  Associated  with  Flow  Localization  Analysis 

Once  the  flow  localization  into  narrow  bands  of  intense  straining  starts,  problems  associated 
with  the  incompressibility  of  the  materials,  and  an  excessive  distortion  of  the  element  due  to 
high  deformation  gradients  appear.  Furthermore,  due  to  the  ill-posedness  of  the  problem,  the 
numerical  solution  of  localization  problems  for  a  rate-independent  solid  exhibits  an  inherent 
mesh  dependence  [45 1. 

The  problem  associated  with  the  incompressibility  of  the  materials  was  comprehensively 
discussed  by  Nagtcgaarl  et  al.  [46],  who  proposed  the  well-known  crossed  triangular  element. 
Subsequently,  many  elements  with  reduced  or  selective  reduced  integrations  [47|  and  with 
hourglass  control  [48]  have  been  developed.  The  aspect  ratio  of  the  quadrilaterals  (crossed 
triangular  elements)  is  chosen  so  that  they  deform  by  alignment  with  the  most  critical  shear 
band  direction  [49],  By  contrast,  in  the  conventional  quadrilateral  elements,  suitable  hourglass 
control  must  be  introduced  to  capture  a  sharp  shear  band  [29|. 

Regardless  of  the  elements  employed,  the  elements  in  the  localized  zone  are  excessively 
distorted  and  the  performance  of  the  approximation  of  the  velocity  distribution  becomes  poor. 
For  this  problem,  Ortiz  and  Quigley  [50]  proposed  the  adaptive  h-method  in  which  the  re¬ 
finement  of  the  mesh  is  performed  in  specific  regions  such  that  the  variation  of  the  solution  is 
within  a  prescribed  tolerance  over  the  element  in  the  mesh.  An  adaptive  remeshing  algorithm 
to  produce  constant  error  finite  elements  was  also  proposed  [51 J.  In  the  adaptive  method,  a 
proper  transfer  operator  must  be  defined  to  correctly  redistribute  the  variables  for  the  new 
mesh  [50].  However,  the  problem  associated  with  mesh  dependence  still  remains  an  open 
question.  This  will  be  discussed  next. 

4.2  Localization  Analysis  with  Regularization 

The  problem  associated  with  the  ill-posedness  can  be  resolved  by  introducing  a  suitable 
length  scale.  The  viscoplastic  regularization  is  aimed  at  introducing  the  length  scale  by  the 
strain  rate  sensitivity  of  the  material  to  make  the  original  problem  into  a  well-posed  one 
[45,52,53].  Without  regularization,  the  widths  of  the  computed  shear  bands  are  set  by  the 
mesh  spacings,  as  observed  in  many  localization  analyses.  However,  for  mesh  sizes  which 
appropriately  resolve  the  localization  with  suitable  regularization,  convergence  occurs  with 
the  refinements  [53  j.  Thus  viscosity  can  be  a  regularization  parameter  from  a  computational 
point  of  view  or  a  micromechanical  parameter  to  be  determined  from  observed  shear-band 
width  from  a  physical  point  of  view  [53  j.  The  possibility  of  the  existence  of  a  stable  stationary 
shear  band  of  a  finite  thickness  has  been  shown  [35|.  Nevertheless,  when  a  flow  localization 
dcvelopes  sufficiently,  the  localization  zones  again  exhibit  mesh  dependency  ( 35,54,55  ]. 


The  length  scale  can  be  naturally  introduced  by  employing  the  nonlocal  continuum.  When 
the  magnitude  of  inelastic  strain  rate  Dp  at  location  x  is  defined  by  spatial  averaging  while  the 
elastic  behavior  is  treated  as  local  [54],  the  elastic-plastic  constitutive  equation  can  be  ex¬ 
pressed  by  Eq.(2.3).  Then  the  same  finite  element  model  can  be  used  [56).  For  damaging 
materials,  Bazant  and  Cabot  1 56]  defined  the  nonlocal  damage  parameter  expressed  by  spatial 
averaging  as  in  the  plastic  strain  rate  Dp  and  succeeded  in  the  substantial  reduction  of  the 
mesh  sensitivity  of  the  flow  localizations. 

Aifantis  [57]  introduced  higher  strain  gradient  terms  in  the  flow  stress  to  represent  the 
microstructural  dependent  nature  of  the  material  response.  This  provides  a  natural  length  scale 
to  the  problem  and  makes  it  possible  to  investigate  the  structure  of  shear  bands  (58,59].  Using 
strain-gradient-dependent  flow  stress  for  the  thermal  softening  material 

o  =  o(  e,T)  -  cV2F,  (4. 1 ) 


Tomita  and  Nakao  [60,61]  showed  that  the  finite  element  predictions  of  the  evolution  of  shear 
band  thickness  and  strain  distribution  can  duplicate  the  analytical  results  [59]  without  mesh 
sensitivity.  Figure  4.1  shows  the  finite  element  mesh-size-independent  prediction  of  shear 
localization  and  the  specimen-size-independent  nature  of  shear  banding  (60,61).  It  has  been 
proven  numerically  that  the  incorporation  of  the  length  scale  in  a  different  manner  equally 
eliminates  the  mesh  sensitivity  of  the  flow  localization  process  when  the  length  scales  are 
suitably  estimated  [62]. 
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Figure  4.1  Representative  strain  distribution  in  the  shear  hand,  (a)  Computational  model,  (b) 
Effect  of  mesh  division  with  specimen  size  I,  (c)  Effect  of  specimen  size  with  mesh  division  C, 
c:  positive  parameter  in  Eq.(4. 1).  (from  [60]). 


Numerical  simulations  using  a  Cosserat  continuum  |63j  and  a  constitutive  equation  of  dipo¬ 
lar  materials  1 64 [  clarified  that  the  strain  profile  over  the  localization  zone  and  the  magnitude 
of  energy  required  for  deformation  converge  to  certain  unique  values  for  static  as  well  as  for 
transient  loading. 

It  is  noted  that  a  more  rigorous  investigation  concerning  the  introduction  of  the  higher-order 
gradient  term  is  necessary  [65].  Furthermore,  the  problem  associated  with  the  identification  of 
length  scales  is  still  open.  It  would  be  natural  to  relate  them  to  the  size  of  the  real  localization 
zone. 
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To  resolve  the  sharp  strain  gradients,  it  has  been  proposed  to  add  a  higher-order  deformation 
mode  to  the  conventional  finite  element  {66.67].  In  the  element,  velocity  Fields  containing  a 
shear  band/  plane  can  be  expressed  by 

Vf2  Nvia  +  Z  Mjh  yb,  (4.2) 

where  the  first  term  is  a  C°  finite  element  interpolation  and  the  second  stands  for  the  incom¬ 
patible  term  expressed  by  the  localized  mode  amplitude  yh  and  the  shape  function  Mjb.  The 
direction  of  the  shear  band/plane  is  estimated  at  the  Gauss  integration  points  by  the  shear  band 
analysis  ]68]  and  the  linear  perturbation  analysis  (69]  for  strain-rate-independent  and 
-dependent  materials,  respectively. 

When  the  thickness  of  the  shear  band  r  smaller  than  the  size  of  the  element,  an  embedded 
formulation  where  the  bandwidth  or  the  profile  of  the  strain  fields  is  assumed  to  be  known  a 
priori  [70]  restores  the  wcll-poscdness  of  the  problem  for  strain-rate-independent  materials. 
For  strain  rate -dependent  materials,  since  the  problem  is  wellposed,  the  enriched  formulation 
[71]  with  a  spectral  interpolation  is  used  to  approximate  the  localized  strain  fields.  Then  the 
mesh  dependence  is  substantially  reduced  in  the  embedded  element  and  when  used  for  a 
coarse  mesh,  it  predicts  the  effective  strain  much  better  than  the  standard  element.  Recently, 
the  ability  of  different  finite  element  enhancements  to  capture  localization  was  discussed  [72]. 


5.  PLASTIC  INSTABILITIES  IN  SPECIFIC  PROBLEMS 

5.1  Instability  Behavior  of  Circular  Tubes 

The  problem  of  instabilities  in  elastic-plastic  tubes  has  been  invesegated  by  a  number  of 
authors.  The  instability  at  the  maximum  pressure  point  is  overcome  by  volume  loading  [73], 
The  bifurcation  points  for  internal  pressure  [74,75]  and  external  pressure[76]  under  axially 
plane  strain  conditions,  and  for  combined  loading  of  internal  pressure  and  axial  force  [44,77] 
have  been  obtained  by  employing  Eq.(3.5). 

The  initial-to-medium  postbi  furcation  behavior  [75,76]  and  localization  of  the  deformation 
accompanied  by  shear  bands  [78]  have  been  clarified  under  axially  plane  strain  conditions. 
Tomila  et  al.  [22,44]  investigated  the  loading-path-dependent  bifurcation  and  postbi furcation 
behavior  of  tubes  subjected  to  axial  tension  and  internal/external  pressure.  The  influence  of 
axial  loads,  the  Bauschinger  effect  and  corner  structure  on  the  yield  surface  on  the  onset  and 
growth  of  surface  instability  and  shears  band  were  investigated.  In  these  analyses,  to  avoid 
instability  in  the  numerical  analysis  in  the  vicinity  of  the  maximum  pressure  point,  computa¬ 
tions  were  performed  under  displacement  control  [79],  Quite  recently,  Tvcrgaard  [80]  predict¬ 
ed  the  nonaxisymmetric  modes  leading  to  localized  necking  and  the  subsequent  shear  fracture 
on  one  side  of  the  tube  [78]  in  connection  with  secondary  bifurcation  points  after  the 
development  of  an  axisymmetric  localized  bulge. 

When  an  elastic-plastic  circular  tube  is  drawn  axisymmetrically  through  a  frictionless  die, 
bifurcation  with  short  circumferential  wavelength  over  the  traction-free  inner  surface,  a 
surface-type  bifurcation  [81,82],  or  long  wavelength  mode,  becomes  possible  [83).  As  the 
tube  deforms  further  beyond  the  bifurcation  point,  the  surface  wave  will  grow  into  a  consider¬ 
able  surface  undulation  and  ultimately  may  lead  to  the  development  of  a  shear  band  connect¬ 
ing  the  highly  strained  regions  beneath  the  traction-free  surface  in  a  zigzag  fashion  [84].  In 
the  composite  tube  [85 1,  besides  the  surface-type  bifurcation,  the  boundary -type  bifurcation 
with  large  mode  amplitude  at  the  boundary  plane  of  two  materials  appears.  Figure  5.1  clari¬ 
fies  that  the  boundary-type  bifurcation  may  cause  strong  strain  localization  which  starts  near 
the  plane  and  extends  on  either  side  of  the  plane. 

5.2  Instability  Propagations  in  Polymeric  Material 

The  mechanical  aspects  of  instability  propagation  in  polymeric  material  have  recently  re¬ 
ceived  much  attention.  Hutchinson  and  Neale  [86]  and  Chaler  and  Hutchinson  [ 87 ]  clarified 
the  characteristic  feature  of  the  propagations  of  necking  in  tension  blocks,  bulging  in  balloons 


and  buckling  in  tubes  under  lateral  pressure  in  terj.is  of  simplified  analyses.  In  further 
studies,  full  finite  element  analyses  for  solid  circular  bars  (MS),  plane  strain  blocks  |89j  and 
tubes  (g0|  have  been  conducted.  Figure  5.2  shows  quasi-static  neck  and  bulge  propagations 
with  respect  to  circumferential  and  axial  directions,  respectively,  in  cylinders  under  plane 
strain  and  axisymmetric  deformation  conditions,  respectively,  with  internal  pressure.  It  is 
noted  that  the  thickness  nonuniformity  induced  by  neck  propagation  along  the  circumferential 
direction  diminishes  when  the  propagation  of  axisymmetric  necking  and  bulging  begins.  For 
further  discussion  concerning  buckle  propagation,  see  Kyriakides  |9T). 


Figure  5.1  Representative  strain  distribution 
for  different  stages  of  deformation. 
c2:  average  strain  (from[85]). 


Figure  5.2  Neck  and  bulge  propagations  of 
circular  tubes  under  internal  pressure  p. 
Plastic  strain  (left)  and  its  increment  (right) 
distributions  (from  [90]). 
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Figure  5.3  Thermo-elasto-viscoplastic  neck  propagation,  (a)  Neck  profile,  (b)  Temperature 
distribution  along  tension  axis  y.  u:  end  displacement,  2L:  initial  length.  T  initial  tempera¬ 
ture,  Average  strain  rate:  2*10  s0,  AD:  Locally  adiabatic  deformation  (from  ,94|). 


In  a  subsequent  study,  the  effects  of  strain  rate  sensitivity  |92|  and  the  temperature  depend¬ 
ence  (93,94)  on  the  neck  propagation  behavior  were  investigated.  Figure  5.3  shows  the 
thermo-elasto-viscoplastic  neck  propagation  behavior.  At  a  low  rate  of  deformation,  stabi¬ 
lization  effects  by  upturns  in  the  uniaxial  stress-strain  relation  overcome  the  destabilization 


effect  due  to  thermal  softening,  and  neck  propagation  with  a  heat  source  can  be  observed.  The 
deformation-induced  heating  and  its  conduction  strongly  affect  the  neck  propagation  behavior 
for  a  relatively  low  rate  of  deformation;  predictions  with  adiabatic  assumptions  do  not  provide 
proper  results. 

Due  to  the  nonuniform  and  three-dimensional  nature  of  the  deformation  behavior  of  poly¬ 
meric  material,  identification  of  the  constitutive  response  from  the  load  elongation  curve  is 
quite  difficult  [95 1  and  requires  both  precise  experiments  and  computational  simulations  (94  j. 
Furthermore,  anisotropy  caused  by  the  microscopic  mechanisms  of  the  molecular  chains  and 
to  the  distribution  of  their  orientations  due  to  the  plastic  deformation  is  quite  important  and 
can  be  characterized  by  the  molecular  orientation  developing  with  plastic  straining  [96J. 


53  Wrinkling  Formation  During  Forming  Processes 

In  connection  with  deep  drawing  of  circular  cylinders,  the  nonaxisymmetric  wrinkling  behavior 
of  a  flat  flange  was  first  investigated  by  employing  the  simplified  bifurcation  functional  (3.1 1) 
with  by=0  in  conjunction  with  admissible  velocities  v’(  (3.4)  |97).  Subsequently,  it  extended  to 
the  wrinkling  behavior  of  general  axisymmetric  shells  [98|.  The  same  or  similar  procedures 
have  been  employed  to  clarify  the  effect  of  material  characteristics  and  the  restraining  effect  of 
the  blank-holding  force  [97,99,100],  die  angles  [97J,  normal  anisotropy  and  details  of  die 
geometry  (98)  on  the  onset  of  wrinkling  during  the  deep  drawing  processes.  Figure  5.4  shows 
nosing  ratios  xw  an<^  Xwc  at  wr'n^'ng  and  critical  wrinkling  points,  respectively,  versus 
the  initial  thickness  to  mean  radius  ratio  (t0/R0),  and  the  number  of  wrinkles  m  for  different 
die  semi-angles  a.  The  typical  wrinkle  modes  are  illustrated  in  the  figure. 


Figure  j.j  wrintcung  oenavior  oi  square  sneei  unuer  uia^unai  tension,  v.ompuiauonai 
model,  X,  Y:  orthotropic  axes,  (b)  Wrinkling  modes,  w’,  6  x,  6*  :  deflectiona!  velocity,  rota¬ 
tional  velocities  around  y  and  x  axes,  respectively,  (c)  Effect  of  work  hardening  exponent  n, 
yield  stress  o  ,  normal  anisotropy  T  and  tensile  direction  M'  in  (a)  on  wrinkle  height  h  at 
e=0.02  (from  [^1J). 


In  stretch  forming  with  a  draw  head,  the  fundamental  deformation  path  includes  the  strong 
deviation  from  the  proportional  loading  path  or  unloading,  and  the  bifurcation  analysis  men¬ 
tioned  in  section  3.1  with  corner  theory  |20|  is  performed  [39|.  The  strong  deviations  from 
proportional  loading  have  an  important  stiffening  effect  in  the  material,  and  thus  the  onset  of 
wrinkling  is  considerably  delayed  as  compared  with  that  predicted  by  the  deformation  theory 
[39].  Furthermore,  the  results  clarified  that  the  draw  bead  restrains  puckering  and  promotes 
the  onset  of  localized  necking,  which  shows  good  agreement  w  ith  those  of  experiments 
(1011- 

Stretching  a  square  sheet  along  one  ot  its  diagonals  (see  Fig.5.?),  which  has  been  developed 
to  simulate  the  wrinkling  behavior  in  press  forming  of  auto  body  panels,  has  been  investigated 
[19,21]  by  employing  the  simplified  bifurcation  functional  in  Eq.(3.9)  with  by=0  in  conjunc¬ 
tion  with  corner  theory  j 20 J  with  Hill's  orthotropic  yield  function  |  i()2|.  A  substantial  delay 
of  wrinkling  is  observed  when  the  plate  is  stretched  to  the  minimum  r  value  direction.  The 
effect  of  the  oilhotropy  of  a  material  on  wrinkling  behavior  is  suppressed  as  compared  with 
that  predicted  with  the  flow  theory  type  of  constitutive  equation  [  19). 

To  investigate  the  growth  rate  of  wrinkles  and  the  residual  height  of  wrinkles,  three- 
dimensional  shell  analysis  with  a  slight  initial  imperfection  which  has  a  similar  shape  to  that 
of  the  wrinkling  mode  shown  in  Fig.5.5(b)  has  been  performed.  Figure  (e)  shows  dependency 
of  wrinkle  height  h  on  material  parameters  and  tensile  direction.  The  same  tendency  is  ob¬ 
served  for  residual  wrinkle  height.  The  same  problems  have  been  recently  traced  j  1 03 ). 


F:gure  5.6  Wrinkling  behavior  of  rolled  thin  sheet  metal.  Wrinkling  points  (left)  and  Wrin¬ 
kling  modes  (right)  (from  (104|). 


The  same  procedure  may  be  applied  to  the  wrinkling  behavior  in  sheet  metal  rolling  in 
Fig.5.6.  Wrinkles  at  45  degrees  and  0  degree  with  respect  to  the  rolling  direction  are  called 
cross  and  vertical  buckles,  respectively.  Figure  5.6  shows  that  the  compressive  force  due  to 
friction  or  shear  force  induced  by  the  slight  inclination  of  the  work  roll  acting  at  both  ends  of 
the  sheet  is  responsible  for  the  formation  of  these  wrinkles  [104].  A  different  type  of  wrin¬ 
kling  is  caused  by  nonuniform  tension  due  to  rolling  with  a  crown  roll  1 104). 

As  discussed  in  section  3.2,  since  the  interaction  between  the  wrinkling-  and  necking-type 
bifurcations  may  be  automatically  introduced  for  sheets  with  curvature,  the  effect  of  sheet 
geometry  and  material  properties  on  the  critical  stress  states  for  wrinkling  and  necking  can  be 
clarified.  Furthermore,  when  the  wrinkling  is  confined  to  a  local  region  of  the  sheet  metal  and 
such  a  region  can  be  isolated  from  the  remainder  of  the  sheet  metal,  the  curvature  and  the 
thickness-dependent  forming  limit  diagram  (FLD) }  1 05 J  and  wrinkling  limit  diagram  (WLD) 
[106,107]  may  be  established.  WLD,  FLD  and  information  from  the  sheet  forming  codes 
based  on  the  membrane  theory  will  provide  information  concerning  the  onset  of  wrinkling 
and  necking. 


Most  of  the  problems  associated  with  sheet  metal  forming  with  a  tool  fail  into  the  category 
of  unilateral  contact  [108,109],  which  is  not  discussed  here. 

5.4  Flow  Localization  of  Thermo-Elasto- Viscoplastic  Solids 

Real  strain-rate-dependent  flow  localization  manifests  itself  as  different  features  depending 
on  the  rate  of  deformation,  w  hereas  it  is  essentially  the  same  under  the  isothermal  condition 
[110,111],  meaning  that  thermocoupied  analysis  is  inevitable.  Plane  strain  tension 
[29,30,55,60,61,112],  compression  [26],  and  plane  strain  compression  near  a  stress-free 
surface  [27]  have  been  analyzed  without  the  inertial  effect  by  assuming  an  adiabatic  process 
which  represents  the  upper  bound  of  the  temperature  [29,1 12],  and  by  accounting  for  heat 
conduction  [26,27,55,60,61].  Plane  strain  quadrilateral  elements  with  hourglass  control 
[29,113]  and  crossed  triangular  elements  [26,27,55,60,61]  arc  employed.  An  intensification  of 
shear  localization  has  been  observed  for  larger  specimens,  and  the  adiabatic  assumption  may 
provide  suitable  information  only  in  the  specific  order  of  the  strain  rate  which  increases  as  the 
specimen  size  decreases.  Localization  of  the  deformation  is  delayed  by  the  strain  rate  effect 
and  strain  gradient  dependence  of  the  yield  stress,  as  seen  in  Eq.(4.1),  while  increase  in  the 
curvature  of  the  flow  potential  surface,  heat  generation  and  noncoaxiality  of  the  strain  rate  to 
deviatoric  stress  have  the  opposite  effect.  The  results  also  illustrate  that  the  effect  of  the  inter¬ 
action  of  material  properties  and  thermal  softening  and  growth  of  voids  arc  two  competing 
and  interacting  softening  mechanisms  in  porous  material. 
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Figure  5.7  Growth  of  surface  undulation  (left)  and  representative  viscoplastic  strain  distribu¬ 
tion  (right)  for  dynamic  compression  of  blocks  near  stress  free  surface  (from  [  1 16]). 

Needleman  [114]  has  performed  dynamic  flow  localization  analyses  for  plane  strain  com¬ 
pression  with  the  strain  softening  constitutive  equation  as  a  simple  model  for  a  thermally 
softening  solid.  This  equation  phenomenologically  describes  the  various  microscopic  soften¬ 
ing  mechanisms  including  damage  due  to  voids  and  microcrack  development,  and  texture 
softening.  As  a  result,  the  problem  can  be  treated  from  a  purely  mechanical  point  of  view. 
Except  for  a  significant  delay  of  shear  band  development  due  to  the  inertial  effect,  the  main 
features  of  shear  band  development  are  the  same  as  under  quasi-static  loading  conditions. 
Batra  and  Liu  [115]  investigated  a  similar  problem  by  introducing  a  temperature  bump  at  the 
center  of  the  block  showing  thermally  softening  viscoplastic  response. 

Figure  5.7  shows  the  compression  of  the  block  near  the  stress-free  surface  [116].  Deforma¬ 
tion  process  with  heat  conduction  (CO)  and  locally  an  adiabatic  condition  (AD)  have  been 
assumed.  A  competing  effect  of  thermal  softening  and  inertia  is  observed  in  the  evolution  of 
undulation  and  representative  strain.  According  to  the  experimental  observation,  flow  stress 
increases  with  abrupt  increase  in  strain  rate  [117,118].  The  effect  of  this  material  strain  rate 
history  dependence  on  flow  localization  behavior  has  been  investigated  [116].  Figure  5.8 


depicts  the  results  of  dynamic  compression  of  plane  strain  blocks.  Differences  observed  in 
representative  strain  distribution  (c)  for  11  and  111  arc  attributed  to  the  strain  rate  history 
dependence  of  the  material,  which  certainly  stabilizes  the  deformation,  furthermore,  the 
comparison  between  two  results  for  1  and  11  clarifies  that  the  dynamic  deformation 
subsequently  applied  to  the  quasi-static  deformation  stabilizes  the  deformation.  Meanwhile, 
the  effect  of  strain  rate  sensitivity  exponent  m  on  the  flow  localization  behavior  is  quite 
substantial  (29,1 16].  In  Nemat-Nasscr  et  al.  |29|,  m  was  switched  to  unity  at  a  certain  strain 
rate  so  that  the  bands  were  broader  and  tended  to  disappear  at  very  high  strain  rale.  For 
dipolar  materials,  the  localization  of  deformation  is  significantly  delayed,  and  localized 
deformation  zones  are  wider  as  compared  with  those  for  nonpolar  materials  | !  I9J. 

The  three-dimensional  aspects  of  localized  deformation  without  inertial  effect  ( 120]  and  with 
inertial  effect  (69,113]  have  been  investigated.  Zbib  and  Jubran  (113]  clarified  the  smooth 
transition  of  plane  stress  to  plane  strain  deformation  by  employing  very  thin  to  thick  speci¬ 
mens.  The  orientations  of  the  shear  band  are  35.25  and  45  degrees,  and  they  are  consistent 
with  theoretical  predictions.  Again,  a  softening  mechanism  and  an  initial  imperfection  are  the 
main  causes  of  shear  banding.  The  multiaxial  effect  stabilizes  the  deformation  and  causes 
delay  of  the  localization  j  1 13,120], 


(c) 


Figure  5.8  Dynamic  compression  of  blocks  under  plane  strain  condition,  (a)  Computational 
model,  W/L=l,  (b)  Boundary  condition  applied  at  both  ends,  Uj/L^O. 002/s,  u,/L=10J/s, 
T  =  10'7s,  I:  constant  displacement  rate  (u=u2),  II:  displacement  rate  jump  (u=u ,  lo  u=u,  at 
t=T;,  corresponding  end  displacement  u/L=0.05)  without  strain  rale  history  dependence,  Ilf:  II 
with  strain  rate  history  dependence,  (c)  Representative  strain  distribution  al  u/L=0. 125  (from 
[116]). 

5.5  Flow  Localization  of  Mono-  and  Polycrystalline  Solids 

Physically  observed  features  of  How  localizations  induced  by  the  evolution  of  microstrueturc 
and  their  interaction  are  quite  complicated,  and  hence  the  limitations  of  the  phenomenologi¬ 
cal  approaches  may  be  undeniable.  In  such  a  situation,  an  essential  procedure  is  the  direct 
physical  approach. 

In  the  crystalline  model,  constitutive  equations  originated  by  Asaro  and  Rice  |23|  and  Asaro 
(24)  have  been  extensively  employed.  Peirce  el  al.  1 121  ]  carried  out  a  finite  element  analysis 
of  macroscopically  localized  deformation  in  single  crystals  deformed  by  tension  bv  employ¬ 
ing  the  two-dimensional  double -slip  model  of  Asaro  ]  2-4  ] .  The  results  clarified  the  geometrical 
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softening  due  to  lattice  rotation,  anti  the  effect  of  self-  and  latent  hardening  on  shear 
localization,  which  are  in  close  agreement  with  experiments.  When  the  single  crystals  deform 
under  a  high  strain  rate,  thermal  softening  coupling  with  geometrical  softening  causes 
substantial  acceleration  in  shear  band  initiation  [122).  Kitagawa  and  Matsushita  [123,124) 
clarified  that  when  the  number  of  slipdirections  increases,  the  shear  banding  behavior 
becomes  close  to  that  for  J,  deformation  theory'. 

Figure  5.9(a)  shows  mode  1  crack  tip  displacement  in  a  silicon  iron  plate  deformed  under 
plane  strain  conditions  observed  experimentally  [125],  and  (b),(c)  depict  the  displacement 
vector  and  contour  of  representative  strain  calculated  using  a  single-crystalline  model,  res¬ 
pectively  [126].  Although  the  material  properties,  crack  shape  and  loading  conditions  are  not 
always  the  same,  good  correspondence  can  be  found  concerning  the  formation  of  the  sharp 
corner  at  the  crack  tip  and  the  manner  of  spread  of  the  deformation  field,  which  is  limited  to  a 
narrow  band  emanating  from  the  tip  in  a  radial  direction. 


Figure  5.9  Deformation  behavior  of  silicon  iron  cracked  plate,  (a)  Crack  tip  deformation  and 
displacement  observed  experimentally  (from  [125]),  (b)  Displacement  vector,  (c)  Representa¬ 
tive  strain  distribution  (from  [126]). 

In  order  to  understand  the  role  of  grain  boundaries  on  the  localization,  Lemonds  et  ai.  [  127] 
investigated  bicrystals  which  have  the  same  geometry  but  have  different  levels  of  latent 
hardening  and  grain  boundaries  across  which  crystallographic  misorientation  exists.  It  was 
found  that  the  boundaries  cause  a  nonuniform  deformation  from  the  onset  of  plastic  flow,  and 
the  formation  of  shear  bands  is  strongly  affected  by  the  initial  misorientation  across  the 
boundary  and  by  the  difference  in  latent  hardening. 

The  primitive  polycrystalline  model  has  been  constructed  as  a  heterogeneous  aggregation  of 
the  single  crystal  with  different  slip  systems  [123,124].  To  account  for  the  inhomogeneous 
deformation  within  the  single-crystal  grain,  the  grain  is  divided  into  several  finite  elements 
whose  constitutive  response  is  expressed  by  a  single-crystal  model  such  as  Asaro's  model 
[24]  with  the  same  slip  direction.  The  grain  boundaries  arc  the  same  in  bicrystals.  The  com¬ 
patibility  of  the  deformation  and  equilibrium  are  satisfied.  The  model  provides  insight  as  to 
how  the  flow  localization  depends  on  the  grain  size,  grain  profile  and  the  orientation  of  the 
slip  system.  The  evolution  of  representative  plastic  strain  in  Fig. 5. 10  captures  the 
characteristic  feature  of  the  flow  localization  in  tension  blocks.  That  is,  microscopically,  a 
heterogeneous  structure  of  the  material  microscopically  produces  an  inhomogeneous 
deformation  field,  and  then  surface  roughness  increases  steadily  with  increase  of  macroscopic 
strain.  Irregularity  of  deformation  develops  more  rapidly  near  the  free  surface  and  the  central 
part  of  the  body.  These  deformation  fields  cause  the  macroscopic  shear  bands  seen  in  the 


figure.  As  the  number  of  grains  along  the  thickness  decreases,  the  deformation  behavior 
becomes  more  dependent  on  such  initial  grain  characteristics  as  the  shape,  size  and  orientation 
of  the  slip  plane  [128]. 

Harren  and  Asaro  [129)  investigated  nonuniform  deformation  behaviors  in  polycrystals 
using  similar  but  more  elaborate  models  than  these  mentioned  above.  Two-dimensional 
polycrystals  are  the  aggregate  of  hexagonal  single  crystals  with  three  slip  directions.  A  patchy 
nonuniform  deformation  mode  associated  with  the  substantial  jumps  in  current  lattice 
orientation,  i.e.,  the  subgrain  boundary,  occurred.  It  was  also  clarified,  for  example,  that  the 
grain  boundaries  act  as  an  impedance  to  the  slip  activity,  while  the  triple  points  act  as 
initiators  of  nonuniform  deformation  modes.  How  shear  bands  cross  the  gram  boundary  was 
elucidated,  and  the  validity  of  Taylor's  model  was  also  precisely  examined.  However,  again, 
proper  modeling  of  the  grain  boundary  and  high  resolution  is  required  for  the  prediction  of 
initial  grain  level  shear  banding  [129]. 


Figure  5.10  Load  vs  elongation  and  evolution  of  representative  plastic  strain  in  polycrystalline 
block  under  tension  (from  [123,  124]). 

Solution  of  these  problems  can  be  expected  with  the  recent  development  of  mechanical 
analysis  employing  molecular  dynamic  (MD)  simulation,  which  is  the  most  elemental 
method  and  has  potential  for  treatment  of  the  dynamics  of  the  crystal  defect,  such  as  vacancy, 
dislocation  and  twinning.  Although  the  large  discrepancy  in  lime  and  space  in  the  continuum 
analysis  and  MD  analysis  is  inevitable,  further  development  is  expected,  and  indeed,  quite 
intensive  studies  are  now  under  way  to  bridge  the  micro-  and  macroscopic  deformation 
behaviors  [130], 


6.  CONCLUDING  REMARKS 

An  introduction  has  been  given  to  the  computational  prediction  of  plastic  instabilities.  The 
computational  strategies  to  predict  the  onset  of  bifurcation  for  general  solids  and  thin-walled 
bodies,  and  to  mesh  independent  predictions  of  flow  localization  behavior  were  discussed. 
Several  topics,  mainly  selected  from  the  contributions  during  the  last  decade,  including 
instability  behaviors  of  thick  circular  lubes  under  various  kinds  of  loading,  instability 
propagation  in  thermo-elasto-viscoplastic  polymeric  materials,  wrinkling  of  thin  walled 
bodies  under  different  type  of  metal  forming  processes,  dynamic  flow  localization  of  thermo- 
elasto-viscoplastic  blocks  and  flow  localization  of  mono-  and  polycrystalline  solids  were 
reviewed. 

Thus,  computational  studies  can  play  an  important  role  in  understanding  and  modeling  the 
wide  range  of  instability  behavior.  The  problem,  however,  associated  with  flow  localization 
still  to  be  solved  is  the  precise  prediction  of  the  chain  of  events  observed  in  deforming 
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material,  that  is,  the  initial  stage  of  deformation  which  includes  the  microscopically 
inhomogeneous  deformation  leading  to  instability  to  tl.e  final  stage  of  macroscopic  failure. 
To  clarify  the  continuous  transition  from  micro-  to  macroscopic  deformation  behavior,  the 
development  of  direct  physical  approaches  for  each  class  and  bridging  of  the  substantial  gap 
in  size  and  time  between  the  different  classes  are  essential.  The  suggestive  report  related  to 
this  is  given  in  ref.  131. 
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Abstract 

Some  of  the  roles  of  breaking  in  air-sea  interaction  are  reviewed.  At  present  we  have 
no  rational  way  of  theoretically  predicting  the  evolution  of  an  air- water  interface  beyond 
breaking,  so  progress  depends  to  a  great  extent  on  experimental  measurements  in  both 
the  laboratory  and  the  field.  Even  the  task  of  identifying  breaking  in  a  random  wave  field 
is  not  simple  and  much  effort  has  been  expended  in  determining  rational  breaking  criteria 
and  their  measurement.  As  a  consequence  of  breaking,  much  of  the  local  momentum  flux 
transmitted  from  the  atmosphere  to  the  ocean  via  the  wave  field  is  rapidly  transferred 
to  currents.  Breaking  is  the  primary  dissipative  mechanism  for  the  surface  wave  field, 
making  energy  available  for  mixing  the  surface  layers.  Local  equilibrium  wave  models,  in 
which  wind  input,  nonlinear  transfers,  and  dissipation  due  to  breaking  are  in  balance,  lead 
to  predictions  of  the  wave  spectrum  and  to  estimates  of  the  dissipation.  It  is  shown  that 
recent  field  measurements  of  enhanced  dissipation  in  the  surface  layers  are  consistent  with 
equilibrium  modelling  and  laboratory  measurements  of  dissipation  and  mixing.  Enhanced 
levels  of  surface  turbulence  and  bubble  mediated  gas  transfer  can  lead  to  significant  in¬ 
creases  in  gas  transfer  across  the  surface  with  the  onset  of  breaking.  Breaking  is  one  of 
the  major  sources  of  ambient  sound  in  the  ocean.  Recent  results  on  the  use  of  ambient 
sound  to  identify  and  quantify  breaking  are  presented. 

1.  Introduction 

The  study  of  surface  waves  has  a  long  tradition  in  theoretical  and  applied  mechanics, 
but  even  before  the  formal  investigation  of  the  subject  it  must  have  been  of  interest  for  as 
long  as  man  has  been  involved  in  maritime  exploration  and  trade.  The  dangers  presented 
to  early  explorers  by  wind  and  waves  are  readily  apparent  when  it  is  observed  that  many 
were  putting  to  sea  in  ships  which  were  barely  longer  than  many  of  today’s  cruising 
yachts.  Even  today,  fishing  vessels  capsize  and  are  lost  in  steep  and  breaking  seas.  While 
steep  and  breaking  seas  provide  the  extreme  environments  that  are  encountered,  it  is  still 
surprisingly  the  case  that  most  descriptions  of  the  ocean  surface,  whether  for  engineering 
or  oceanographic  purposes,  are  based  on  linear  theory.  The  foundations  of  linear  wave 
theory  were  laid  by  many  of  the  founders  of  theoretical  hydrodynamics  in  the  nineteenth 
century.  In  1847  Stokes  published  his  investigation  of  weak  nonlinear  effects  on  surface 
waves,  but  it  took  more  than  a  century  for  the  subject  to  extend  to  an  investigation 
of  nonlinear  interactions  between  surface  waves  (Phillips,  1960)  and  subsequently  the 
stability  of  weakly  nonlinear  waves  (Benjamin  h  Feir,  1967). 
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Convenient  measures  of  the  nonlinearity  are  the  wave  steepness  ak ,  where  a  is  the  wave 
amplitude  and  k  is  the  wave  number,  or  equivalently,  ujc  where  u  is  a  measure  of  the 
horizontal  fluid  velocity  at  the  surface  and  c  is  the  phase  speed.  In  breaking  waves  u/c 
may  exceed  unity  while  ak  is  0(1),  and  so  breaking  waves  are  strongly  nonlinear.  Rigorous 
theoretical  investigations  of  breaking  have  been  confined  to  describing  local  features  of 
the  flow  (Longuet-Higgins,  1988).  Numerical  codes  are  available  to  describe  the  evolution 
of  two-dimensional  surface  waves  up  to  breaking,  but  they  are  unable  to  continue  beyond 
the  point  at  which  the  surface  impacts  on  itself  (Dommermuth  et  al.,  1987).  Thus  the 
study  of  wave  breaking  is  based  in  large  part  on  experiments  in  both  the  laboratory  and 
the  field. 

The  most  common  expression  of  breaking  at  the  sea  surface  is  a  vvhitecap  due  to  the 
entrained  air.1  Whitecaps  become  obvious  at  scales  of  0(lm)  extending  up  to  0(1 00m), 
but  breaking  may  occur  without  significant  air  entrainment  down  to  scales  of  centimeters. 
This  lack  of  a  universal  visual  feature  identifying  breaking  and  the  difficulty  of  measuring 
variables  in  a  tu'o-phase  free-surface  turbulent  flow  have  made  progress  difficult,  especially 
in  the  field.  However,  this  difficulty  is  justified  by  the  importance  of  breaking  to  the 
processes  occurring  at  the  air-sea  interface.  Breaking  plays  a  number  of  related  roles  in 
air-sea  interaction.  These  include: 

a.  Limiting  the  height  (slope)  of  surface  waves. 

b.  Being  a  source  of  vorticity  and  turbulence. 

c.  Dissipating  surface  wave  energy,  some  of  which  is  available  for  turbulent  mix¬ 
ing. 

d.  Generating  ocean  currents  by  transferring  momentum  from  the  wave  field. 

e.  Enhancing  gas  transfer  due  to  surface  turbulence  and  bubble  entrainment. 

f.  Generating  sound  at  the  ocean  surface  which  may  be  used  as  a  diagnostic  tool 
for  air-sea  interaction  studies  (Acoustical  Oceanography). 

g.  Providing  passive  and  active  microwave  signatures  which  may  prove  useful  in 
remote  sensing  of  the  ocean. 

It  may  seem  surprising  that  a  process  which  is  intermittent  in  space  and  time,  and 
which,  at  least  at  the  larger  scales,  is  only  occurring  over  a  small  fraction  of  the  ocean 
surface  at  any  time,  may  be  significant.  We  shall  see  below  that  the  direct  effects  of 
breaking  are  typically  felt  to  a  depth  of  the  order  of  the  wave  height.  In  relatively  benign 
conditions  this  may  be  to  depths  of  the  order  of  metres,  but.  in  stormy  seas  this  may  be 
depths  of  O(10m).  The  sensitivity  of  the  atmosphere-ocean  system  to  the  conditions  in 
the  first  ten  metres  of  the  ocean  are  graphically  presented  by  Gill(  1982): 

a.  Of  the  solar  radiation  incident  on  the  atmosphere  40%  is  absorbed  in  the  first 
10 m  of  the  ocean;  more  in  coastal  waters. 

’For  the  proceedings  of  a  recent  conference  on  the  subject  of  whitecaps,  see  Monahan  &  McNiocaill,  1986. 
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b.  The  first  10m  of  the  water  column  has  the  same  weight  as  all  the  atmosphere 
above. (  This  has  important  implications  for  gas  transfer  by  bubbles.) 

c.  The  first  2.5m  of  the  water  column  has  the  same  heat  capacity  as  all  the 
atmosphere  above. 

Now  breaking  is  observed  to  begin  at  a  wind  speed  of  approximately  3m/s.  This,  together 
with  the  fact  that  the  momentum  and  mechanical  energy  fluxes  across  the  air-sea  interface 
typically  scale  as  the  square  and  cube  of  the  wind  speed,  respectively,  implies  that  even 
relatively  short  periods  of  high  winds  and  waves  may  contribute  significant  fractions  of 
the  total  fluxes  across  the  air-sea  interface,  with  breaking  absent  only  under  the  most 
benign  conditions. 

In  this  paper  we  shall  review  some  of  the  recent  contributions  to  our  knowledge  of 
breaking  and  its  role  in  air- sea  interaction.  This  is  not  meant  to  be  an  exhaustive  review 
of  the  subject,  a  task  which  would  expand  well  beyond  the  format  of  these  proceedings. 
However,  it  is  hoped  that  the  reader  will  achieve  an  appreciation  for  the  wide  range  of 
techniques,  both  theoretical  and  experimental,  that  are  being  used  to  investigate  this 
important  phenomenon. 

2.  Breaking  Criteria  and  Detection 

Historically,  the  first  "breaking”  flow  to  be  identified  was  Stokes'  120°  corner  flow  at  the 
crest  of  a  periodic  wave  of  limiting  slope  ( ak  =  0.443).  In  a  frame  of  reference  travelling 
at  the  phctse  speed  the  fluid  at  the  crest  is  at  rest,  and  so  in  the  laboratory  frame  it  is 
moving  at  the  phase  speed  of  the  wave.  This  then  satisfies  the  kinematic  criterion  that 
the  fluid  velocity  matches  or  exceeds  the  phase  velocity.  However,  it  is  most  unlikely 
that  such  a  flow  can  ever  be  realized,  since  uniform  wave  fields  of  much  smaller  slope  are 
subject  to  intrinsic  instabilities  which  may  lead  to  breaking.  Melville  (1982)  has  shown 
experimentally  that  for  wave  slopes  below  approximately  0.3  two-dimensional  waves  are 
unstable  to  two-dimensional  Benjamin-Feir  instabilities  which  lead  to  breaking,  while  at 
larger  slopes  three-dimensional  instabilities  dominate  (  See  also  Su  et  ah,  1982;  McLean 
et  ah,  1981).  Numerical  solutions  hav<  shown  that  uniform  wave  trains  with  slopes  as 
small  as  0.1  may  evolve  to  breaking  following  instability  (Dold  &  Peregrine,  1986). 

Stokes’  limiting  form  gives  an  expression  for  the  wave  height,//,  in  terms  of  the  wave 
period, T,  and  the  gravitational  acceleration,  g : 

f/  =  0.027gT2.  (1) 

Laboratory  experiments  by  a  number  of  authors  (  Figure  1)  have  shown  that  breaking 
occurs  for  average  values  of  the  numerical  factor  in  the  range  0.016-0.022,  or  at  slopes 
based  on  the  wave  height  and  length  of  approximately  two-thirds  that  of  the  limiting 
slope  (  Coincidentally,  this  is  close  to  the  slope  of  0.3  at  which  fast  three-dimensional 
instabilities  begin.)  However,  as  Figure  1  shows,  the  scatter  in  the  data  is  considerable. 

F  ield  observations  of  breaking  have  been  based  on  a  so-called  ’’jump  meter”  which  uses 
the  time  derivative  of  the  surface  displacement  measured  by  a  wave  gauge  along  with 
the  characteristic  phase  speed  of  the  wave  to  determine  spatial  gradients  in  the  surface 
displacement  (Longuet- Higgins  &  Smith,  1983;  Thorpe  &  Humphries,  1980).  Gradients 
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Figure  1.  Laboratory  measure¬ 
ments  of  wave  heights  and  periods  at 
breaking  compared  with  Stokes  lim¬ 
iting  form  (Bonmarin,  1989). 


Figure  2.  Laboratory  measurements  of  nor¬ 
malised  surface  displacement  and  horizon¬ 
tal  fluid  velocity  in  breaking  and  unbroken 
waves  (Melville  &  Rapp,  1988). 


which  exceed  an  arbitrary  threshold  are  considered  to  be  breaking.  Weissman  et  al.(1984) 
have  used  high-passed  wave  gauge  records  to  detect  breaking  in  lake  experiments.  Since 
large  temporal  gradients  are  resolved  by  high  frequency  Fourier  components,  the  jump- 
meter  and  high-frequency  approaches  are  related.  Holthuijsen  &  Herbers(1986)  visually 
observed  whitecaps  at  the  location  of  a  wave  measuring  buoy  and  analysed  their  data  in 
terms  of  the  joint  height-period  probability  distributions  of  both  the  broken  and  unbroken 
waves  passing  the  buoy.  These  distributions  showed  considerable  overlap  and  no  clear 
threshold  for  breaking  based  on  the  combined  wave  height  and  period.  This  contrasts 
with  the  laboratory  measurements  described  above.  However,  the  difficulty  with  point 
measurements,  whether  in  the  laboratory  or  field,  is  that  in  a  random  breaking  wave 
field  they  may  catch  the  wave  at  any  stage  in  the  breaking  process,  and  since  the  wave 
parameters  may  change  during  that  time  they  are  not  necessarily  the  same  as  those  at 
the  instant  of  the  onset  of  breaking. 

This  point  was  made  very  clear  by  a  laboratory  study  by  Melville  &  Rapp(1988)  who 
used  a  laser  anemometer  to  measure  the  horizontal  velocity  of  the  fluid  at  the  surface 
to  detect  breaking.  Figure  3  shows  an  example  of  a  small  section  of  coincident  surface 
displacement  and  velocity  time  series.  Of  the  two  waves  in  the  frame  the  smaller  less  steep 
wave  is  breaking  while  the  larger  steeper  wave  is  not.  The  wave  began  breaking  near  the 
peak  of  the  wave  envelope  but  decreased  in  amplitude  and  slope  as  it  continued  to  break. 
This  example  displays  the  danger  of  using  observations  of  breaking  at  any  stage  in  its 
evolution  to  infer  breaking  criteria.  It  also  points  to  the  difficulties  inherent  in  using  the 


surface  displacement  or  its  first  derivative  in  time  as  an  indicator  of  breaking. 

In  recent  years  other  non-hydrodynamic  methods  of  detecting  breaking  have  been  in¬ 
vestigated  and  exploited.  These  include  acoustic  methods  (about  which  we  will  have 
more  to  say  below),  and  microwave  methods.  The  use  of  active  microwave  methods  to 
detect  and  quantify  breaking  has  been  investigated  by  a  number  of  authors.  Keller  et 
al.  (1986)  presented  an  illustrative  summary  of  the  use  of  coherent  microwave  systems 
in  measuring  breaking  waves.  They  pointed  out  that  both  specular,  Bragg  and  volume 
scattering  may  contribute  to  the  scattered  components.  Banner  Sz  Fooks(  1985)  associated 
enhanced  scattering  by  a  quasi-steady  spilling  wave  wBh  the  Bragg  components  in  the 
smaller-scale  waves  and  surface  disturbances  that  accompany  the  turbulence.  The  author 
and  his  co-workers  (Melville  et  ah,  1988:  Jessup  et  al,  1990,  1991a, b;  Loewen  Melville, 
1991)  have  attempted  to  quantify  the  scattering  by  breaking  waves  and  correlate  it  with 
the  dynamics  of  the  breaking  waves.  In  laboratory  studies  we  have  found  that  the  scat¬ 
tering  correlates  with  the  dissipation  due  to  breaking,  although  the  detailed  reason  for 
this  result  remains  to  be  investigated.  In  the  field  we  have  found  that  the  lower  order 
moments  of  the  microwave  Doppler  spectrum  taken  together  provide  a  good  indicator 
of  breaking  and  that  the  contribution  of  the  breaking  waves  to  the  radar  cross  section 
at  moderate  incidence  angles  is  significant  and  increases  approximately  as  the  cube  of 
the  friction  velocity  of  the  wind.  Notwithstanding  these  successes,  the  fact  remains  that 
steep  but  unbroken  waves  may  have  similar  microwave  signatures  to  breaking  waves,  and 
resolving  the  differences  may  prove  difficult. 

In  summary,  the  only  reliable  methods  for  identifying  breaking  at  all  scales  include 
measurements  of  both  the  surface  geometry  and  the  surface  velocity  field.  Such  methods 
are  confined  to  the  laboratory  at  present.  The  problem  of  detection  and  quantification 
of  the  parameters  describing  breaking  waves  remains  an  impediment  to  field  studies  of 
breaking. 

3.  Momentum  Flux  across  the  Air-Sea  Interface 

The  wind  generates  waves!  The  wind  generates  currents!  Even  to  the  lay  observer 
these  statements  are  obviously  true.  Now  waves  can  transport  momentum  with  a  mean 
momentum  flux  that  is  proportional  to  the  square  of  the  wave  height.  What  is  not  so 
obvious  is  the  partitioning  of  the  fluxes  across  the  air-sea  interface  between  waves  and 
currents.  If  the  waves  were  to  continuously  receive  momentum  from  the  wind  they  would 
continue  to  grow  and  the  momentum  flux  they  carry  would  continue  to  increase.  The 
growth  of  the  waves  is  arrested  by  breaking  and  in  breaking  the  wave  field  gives  up  part 
of  its  momentum  flux  to  currents.  Thus  breaking  is  an  important  process  in  the  generation 
of  ocean  currents. 

These  issues  were  addressed  by  Mitsuyasu  (1985).  The  essentials  may  be  discussed  by 
considering  a  locally  uniform  two-dimensional  wave  field  which  has  spatial  gradients  on  a 
scale  much  larger  than  the  wave  length.  The  wave  momentum  flux,  Sn,  is  given  by 


where  E  —  ^pwga2  is  the  wave  energy  density.  The  momentum  flux  from  the  air  to  the 
water  r0  is  given  by  r0  =  rw  +  r(,  where  rw  is  the  flux  directly  into  the  waves,  and  Tt  is 
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Figure  3.  Laboratory  measurements  of  normalized  variance  of  the  free  surface  dis¬ 
placement  of  a  wave  packet  as  a  function  of  downstream  position.  Note  the  de¬ 
crease  across  the  breaking  region.  Symbols  refer  to  values  of  a  slope  parameter  a0ko  : 
+,0.26;  A, 0.30;  0,0.39.  (Melville  &  Rapp,  1986). 


the  remaining  turbulent  flux  which  is  not  coherent  with  the  wave  field.  Now  tu,  can  be 
related  to  the  gradient  in  the  momentum  flux, 


(3) 


where  0  —  ^ [s  the  energy  density  growth  rate,  and  c  and  cg  are  the  phase  and  group 
velocities,  respectively.  From  laboratory  experiments  in  the  absence  of  breaking  Mitsuyasu 
found  that  tw  =  22(ak)2r0 ,  giving  significant  values  of  rw  for  representative  values  of  the 
wave  slope  ak.  This  estimate  of  the  momentum  flux  directly  into  the  wave  field  was 
supported  by  laboratory  and  field  measurements  giving  ratios  of  ^  in  the  range  0.4  —  0.6 
(Hsu  et  al.,  1981,  1982;  Snyder  et  al.,  1981).  However,  field  measurements  of  the  secular 
wave  growth  showed  that  =  0.05to;  that  is,  only  5%  of  the  momentum  flux  from  the 
atmosphere  is  carried  by  waves  propagating  out  of  the  generation  region.  This  is  only 
10%  of  the  measured  momentum  flux  from  the  wind  to  the  waves.  Mitsuyasu  conjectured 
that  the  remainder  of  tw  is  lost  from  the  wave  field  by  breaking  in  the  generation  region. 

This  argument  was  supported  by  laboratory  experiments  on  unsteady  breaking  by 
Melville  &  Rapp(1986).  Using  the  dispersive  properties  of  deep-water  waves  to  focus 
a  wave  packet  in  a  laboratory  channel,  and  by  measuring  the  wave  field  upstream  and 
downstream  of  the  break,  we  were  able  to  measure  the  excess  momentum  flux  lost  from 
the  wave  field  (Fig.  3).  Using  field  measurements  of  the  incidence  of  breaking  by  Thorpe 
&:  Humphries  (1980),  we  were  able  to  show  that  Mitsuyasu's  conjecture  was  supported 
by  the  combined  laboratory  and  field  data.  The  laboratory  studies  were  considerably 
extended  by  Rapp  &;  Melville(1990). 


We  can  conclude  from  this  work  that  while  a  significant  fraction  of  the  momentum  flux 
from  the  atmosphere  (50%  or  so)  may  pass  through  the  wave  field;  most  of  it  is  lost  to 
currents  by  breaking  in  the  generation  region. 


4.  Wind-Wave  Modelling 


4.1.  The  radiative  transfer  equation 

While  a  rigorous  theoretical  foundation  for  prediction  of  wind-wave  generation  evolution 
and  decay  does  not  yet  exist,  a  sufficient  number  of  the  important  pieces  of  the  puzzle 
can  be  assembled  to  formulate  the  problem  in  terms  of  the  radiative  transfer  equation 
(Phillips,  1977) 


dN  dN 

~dt  =  +  ^  +  U)-VN  =  - IK 


where  N{k)  =  ^F(k)  —  [y / k)1^2 F{k)  is  the  wave  action  spectral  density;  T(k)  is  the 
action  spectral  flux  due  to  wave-wave  interactions:  $u,  is  the  wind  input,  and  D  represents 


the  dissipation  which  is  thought  to  be  due  primarily  to  breaking.  I  =  —  S7.7'(k)  is  the 
’’collision  integral”  representing  the  nonlinear  wave-wave  interactions  as  formulated  by 
Hasselmann  (1962).  In  operational  wave  models  I  is  computed  numerically.  The  wind 
input  term  is  based  on  Miles  theory  (1957,  1962),  and  empirical  input  (Snyder  et  ah, 
1981;  Plant,  1982)  leading  to  an  expression  of  the  form 


Sw  —  m  cos2p  8a(u,a/ c)2  N (k) 


(6) 


(Phillips,  1985),  where  rn  and  p  are  empirical  constants  and  uma  is  the  friction  velocity  in 
the  air.  The  dissipation  term  is  the  least  well  understood.  Hasselmann  (1974)  has  made 
heuristic  arguments  subsequently  modified  by  Komen  et  al.(  1984)  leading  to  an  expression 
of  the  form 

D  oc  a(a/d)2(a/apM)2 N{k)  (6) 


where  d  is  a  slope  parameter  and  the  subscript  ’PM’  refers  to  a  value  representative  of 
the  Pierson- Moskowitz  spectrum  .  Figure  4  shows  an  example  of  the  spectrum  and  the 
contributions  from  the  various  terms  as  computed  by  Komen  et  al  (1984).  According  to 
this  model  the  wind  input  and  dissipation  predominate  at  frequencies  greater  than  the 
spectral  peak,  while  the  effect  of  the  nonlinear  interactions  is  to  add  energy  at  the  lower 
frequencies  while  subtracting  it  from  the  higher  frequencies.  Clearly,  if  this  class  of  models 
is  representative  of  the  physics,  then  the  dissipation  due  to  breaking  is  comparable  to  the 
wind  input  and  must  have  a  profound  effect  on  the  details  of  the  spectral  evolution. 


4.2.  Equilibrium  Models  and  Spectral  Slopes 

With  the  example  of  the  success  of  Kolmogorov’s  universal  equilibrium  and  inertial 
subrange  in  describing  turbulence  spectra  ,  it  is  not  surprising  that  similar  approaches 
have  been  made  in  an  attempt  to  explain  the  essential  characteristics  of  t  he  surface  wave 
energy  spectrum.  The  motivation  for  this  work  has  expanded  in  recent  years  with  the 
need  to  understand  active  microwave  remote  sensing  systems  which  are  directly  affected 
by  the  small  scale  waves  having  wavelengths  of  O(lm)  or  less. 
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Phillips  (1958)  speculated  that  at  high  frequencies  and  wavenumbers  saturation  due 
to  breaking  would  occur,  and  that  the  form  of  the  spectrum  could  only  depend  on  the 
frequency  a  and  gravity  g.  This  led  to  a  frequency  spectrum  of  the  form  F(u)  =  ag2a~b, 
with  a  becoming  known  as  ’Phillips  constant’.  The  deep-water  dispersion  relationship, 
c t 2  —  gk,  then  gives  a  wavenumber  spectrum  F{k )  oc  k~4.  During  the  1970’s,  as  more 
measurements  of  wave  spectra  became  available,  it  became  apparent  that  a  was  not 
constant.  O"  the  basis  of  laboratory  meaeurcments  and  dimensional  analysis  Toba  (19/3) 
proposed  that  F  should  also  depend  on  the  ratio  u.a/c  (where  u.a  is  the  friction  velocity 
in  the  air)  giving 

F{er)  oc  {u.a/c)g2< r-5  =  u.ngcr~A.  (7) 

This  result  was  supported  by  subsequent  laboratory  and  field  data  (Kawai  et  al.,  1977; 
Forestall,  1981;  Donelan  et  ah,  1985)  and  the  parameter  c/u»„  has  become  known  as 
the  ’’wave  age”.  Kitaigorodskii  (1983)  followed  Kolmogorov’s  inertial  subrange  idea  more 
explicitly  considering  an  equilibrium  region  in  which  l  —  Sw  —  D  —  0.  As  a  consequence 
of  this  hypothesis  he  found  that 

F(k)  «  el/3g-^k~y\  (8) 

F{a)  «  tj3gcr~4\  (9) 

this  frequency  spectrum  agreeing  with  the  more  recent  observations.  Kitaigorodskii’s 
model  was  quickly  followed  by  an  equilibrium  model  from  Phillips  (1985).  The  basis 
of  Phillips’  modelling  was  that  the  right  hand  side  of  Equation  4  was  zero  through  an 


equilibrium  in  which  all  terms  were  significant: 

Z  +  S^  +  Z)  =  0  (10) 

/  oc  Sw  oc  D.  (11) 

This  work  provides  the  most  readable  and  rational  account  of  this  class  of  wave  modelling. 
Phillips  found  that  the  energy  spectrum  was  given  by 

F(k)  =  (3{cos0)pu.g-'/2k-7/2  (12) 

and  the  spectral  rate  of  energy  loss  was  given  by 

e(k)  =  7  03(cosO)3pu,k~2.  (13) 


It  may  at  first  seem  surprising  that  the  very  different  hypotheses  of  Kitaigorodskii  and 
Phillips  should  give  the  same  spectral  slope  of  —7/2;  however,  as  pointed  out  by  Phillips, 
the  form  of  /,  which  is  the  same  in  both  cases,  the  assumption  of  proportionality  between 
the  three  terms,  and  dimensional  constraints,  determine  the  slope.  (This  work  also  pro¬ 
vides  a  general  more  reminder  that  the  prediction  of  a  spectral  slope  may  be  a  necessary 
but  not  sufficient  test  of  a  theory.) 

Most  recently,  Banner  (1990)  has  synthesized  a  spectral  model  based  on  observations. 
Writing  the  wavenumber  directional  spectrum  in  the  form 


F(k,0)  =  F(k,6max)D(0-k) 


(14) 
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where  F(k,6max)  oc  A:-4  and D(0\k)  are  based  on  observations;  he  calculates  a  frequency 
spectrum  proportional  to  u,acr-4. 

We  summarize  by  noting  that  our  inability  to  directly  measure  the  spectral  distribution 
of  dissipation  due  to  wavebreaking  is  the  greatest  impediment  to  the  further  development 
of  the  current  generation  of  wind-wave  models.  In  the  absence  of  these  measurements,  the 
local  equilibrium  hypothesis  provides  a  rational  way  of  estimating  the  dissipation  based 
on  our  better  knowledge  of  wind  input  and  weak  nonlinear  transfers.  It  is  also  consistent 
with  a  multiple-scale  approach  in  which  on  short  length  and  time  scales  the  spectrum  is 
in  equilibrium,  while  on  longer  scales  secular  changes  can  occur. 
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Figure  4.  Model  wave  spectrum  and  con-  Figure  5.  Field  measurements  of  enhanced 
tributions  of  the  wind  input,  —  •  -,  nonlin-  dissipation  in  the  surface  layer  (Agrawal  et 

ear  transfer,  •  •  •,  and  dissipation, - ,  in  ah,  1992). 

the  wind  direction  (Komen  et  ah,  1984). 
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5.  Dissipation  due  to  breaking 

In  the  absence  of  a  direct  method  for  testing  the  available  models  of  spectral  dissipation, 
we  can  at  least  determine  whether  they  are  consistent  with  the  available  observations.  We 
consider  Phillips’(  1985)  equilibrium  model  as  the  basis  for  such  a  comparison.  Phillips’ 
integrated  spectral  dissipation  rate  (over  the  equilibrium  range)  ,  c( k ),  gives  a  total  rate 
of  energy  dissipation,  e0)  where 

e0  «  (2j/33I(3p)~)paula  In (rCp1).  (15) 

Pa 

There  are  a  number  of  empirically  based  parameters  included  in  this  expression  which 
Phillips  estimated  to  be  given  by  the  following  approximate  values:  a  ss  6  —  11. 10-2; 
7/33/(3p)  >  4.10“4,  and  r  cs  0.4  —  0.7.  The  drag  coefficient,  Cp  is  taken  to  be  equal  to 
1.2  x  10~3.  With  these  estimates  we  find  that 

€0  >  5 .paula.  (16) 

Now,  laboratory  experiments  on  unsteady  breaking  by  Rapp  h  Melville(1990)  have 
shown  that  more  than  90%  of  the  energy  lost  from  the  wave  field  is  dissipated  in  the  water 
column  within  four  wave  periods  of  the  onset  of  breaking;  the  active  breaking  itself  lasting 
for  a  time  comparable  to  the  wave  period.  This  very  rapid  rate  of  dissipation  was  difficult 
to  reconcile  with  preconceptions  until  our  recent  work  (Lamarre  &  Melville,  1991)  showed 
that  up  to  50%  of  the  energy  lost  from  the  wave  field  is  initially  expended  in  overcoming 
the  buoyancy  forces  acting  on  the  entrained  air.  (See  below).  As  the  air  returns  to  the 
surface  it  acts  as  a  buoyant  source  of  kinetic  energy,  but  also  enhances  dissipation  due  to 
the  shear  in  the  bubble  boundary  layers.  The  experiments  of  Rapp  &  Melville  (1990)  also 
showed  that  the  turbulent  region  generated  by  the  breaking  wave  advected  and  mixed 
down  to  a  dimensionless  depth  kD  m  0.5  —  1,  where  k  is  the  characteristic  wavenumber 
of  the  breaking  waves.  This  suggests  that  the  bulk  of  the  dissipation  in  the  water  column 
due  to  breaking  will  take  place  in  a  layer  which  is  much  less  than  the  wave  length  ,  and  if 
breaking  occurs  for  ak  as  0.3  (c.f.  Figure  1),  then  the  thickness  of  this  highly  dissipative 
layer  would  be  in  the  range  1  —  1 1  wave  heights. 

We  can  be  a  little  more  specific  with  these  ideas  by  assuming  that  the  total  energy  is 
dissipated  over  a  depth  D  at  a  volumetric  rate  ew, 


[D 

Co  “  /  Pw^ujdz 

Jo 

—  Pw^-w^ 


(17) 

(18) 


which  defines  the  mean  dissipation  t~w  over  the  depth  D. 

In  a  logarithmic  boundary  layer  the  dissipation  rate  is  typically  scaled  with  u3/*z 
where  k  is  von  Karman’s  constant  (0.4).  This  scaling  gives  a  dimensionless  dissipation 
rate  of  unity.  It  is  of  interest  to  determine  whether  the  estimated  dissipation  rate  in  the 
water  column  under  breaking  waves  based  on  the  modelling  of  Phillips  (1985)  and  the 
measurements  of  Rapp  h  Melville  (1990)  gives  values  significantly  different  from  those  in 
the  classical  wall  layer.  If  we  use  the  mean  values  of  ew  and  D  to  scale  z,  we  have  that 
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It  remains  to  determine  the  friction  velocity  in  the  air,  u.a ,  in  terms  of  the  friction 
velocity  in  the  water,  u,w.  We  can  investigate  this  relationship  by  considering  the  dispo¬ 
sition  of  the  momentum  flux  lost  from  the  wave  field  by  breaking.  This  momentum  flux 
is  transferred  to  currents.  Given  the  intense  mixing  due  to  breaking  we  anticipate  that 
there  will  a  distinct  surface  current  within  a  depth  of  the  order  of  one  wave  height  from 
the  surface.  Since  we  are  assuming  that  the  wave  field  is  in  approximate  local  equilibrium, 
to  be  consistent  we  must  also  assume  that  the  horizontal  gradients  in  the  current  are  not 
large,  and  in  consequence  are  changing  on  the  same  slow  spatial  scales  as  the  wave  vari¬ 
ables.  This  implies  that  if  such  a  surface  current  exists  then  the  momentum  flux  through 
its  upper  boundary  (the  surface)  is  almost  matched  by  that  passing  through  its  lower 
surface  at  a  depth  comparable  to  the  wave  height.  Thus  we  expect  that  paula  m 
whence,  uraju.w  «  and  ewKzu~ >  60  =  0(10  -  102),sap. 

These  estimates  were  motivated  by  recent  measurements  of  Agrawal  et  al.(1992).  Work¬ 
ing  from  a  platform  in  Lake  Ontario  they  used  optical,  acoustical  and  electro-mechanical 
instruments  to  measure  the  dissipation  near  the  surface.  Their  principal  results  are  pre¬ 
sented  in  Figure  5,  which  shows  the  measured  dissipation  scaled  in  w'all  variables,  as  a 
function  of  depth  scaled  by  ulw/g.  The  figure  may  be  summarized  by  the  observation 
that  for  z  =  gzju\w  >  105  the  law  of  the  wall  scaling  appears  to  hold,  whereas  for  z  <  105 
the  dissipation  rate  is  up  to  two  orders  of  magnitude  greater.  How  does  a  value  of  z  =  10s 
compare  with  the  wave  height?  Agrawal  et  al.(1992)  did  not  give  complete  information 
on  the  supporting  data  for  their  measurements;  however,  they  did  indicate  that  the  fetch 
limited  waves  were  of  the  order  of  30cm  significant  wave  height  ( Hw ),  and  the  winds  were 
greater  than  8ms*1.  Taking  these  values  and  assuming  a  value  of  the  drag  coefficient  in 
the  air  of  Co  =  1.2  x  10~3,  the  scaled  significant  wave  height,  Hw  =  0.4  x  10'5.  Thus 
the  characteristics  of  the  measured  enhanced  dissipation  layer  are  in  agreement  with  the 
estimates  based  on  Phillips  (1985)  modelling  and  our  laboratory  measurements  of  un¬ 
steady  breaking  (Rapp  &  Melville,  1990).  While  Agrawal  et  al.(1992)  speculated  that  the 
source  of  the  high  dissipation  events  was  breaking  waves,  they  stated  that  independent 
(unpublished)  estimates  of  energy  fluxes  to  steep  waves  at  frequencies  greater  than  the 
peak  of  the  spectrum  were  consistent  with  the  enhanced  kinetic  energy  dissipation  they 
had  measured.  As  shown  above,  enhanced  dissipation  also  requires  that  the  energy  be 
dissipated  over  a  sufficiently  small  depth.  The  estimates  presented  here,  which  are  based 
on  both  measurements  and  modelling,  provide  support  for  the  conclusion  that  the  source 
of  the  enhanced  dissipation  is  breaking. 

6.  Gas  Transfer 

Public  debate  has  heightened  awareness  of  the  importance  of  the  oceans  in  modulating 
secular  changes  in  climate.  Perhaps  nothing  has  attracted  as  much  attention  as  the  influ¬ 
ence  of  the  increase  in  greenhouse  gases  on  global  warming.  Quantitative  estimates  of  the 
role  of  the  oceans  in  taking  up  increases  in  CO2  are  based  in  part  on  global  budgets  which 
involve  considerable  uncertainty.  As  Baggeroer  &  Munk  (1992)  have  recently  observed  in 
their  status  report  on  acoustic  monitoring  of  ocean  warming,  ”. .  .all  we  know  for  sure  is 
that  the  oceans  are  an  important  sink  of  heat,  and  CO 2,  and  of  ignorance.” 

For  moderately  soluble  gases  such  as  CO2  and  O2  the  transfer  across  the  air-sea  in- 
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Figure  6.  Field  measurements  of  dissolved  oxygen  and  other  environmental  variables, 
including  wave  height,  wave  orbital  velocity  and  wind  speed.  Note  the  rapid  increase  in 
the  dissolved  gas  with  storm  events  indicated  in  the  wave  orbital  velocity  time  series,  and 
the  slower  relaxation  back  to  lower  levels  (Wallace  &  Wirick,  1991). 


terface  is  believed  to  be  controlled  by  the  aqueous  boundary  layer  (Jaehne,  1990)  and  is 
parameterised  in  terms  of  a  gas  transfer  velocity  k  where  the  molar  flux  of  gas  across  the 
interface,  j  =  &Ac,  and  Ac  is  the  difference  between  the  saturation  and  bulk  concentra¬ 
tion  of  the  dissiolved  gas.  For  low  wind  speeds,  wind-wave  tunnel  studies  have  shown 
that 

k  »  0.155c-a5u,u,  (20) 

(Jaehne,  1990;  Khoo  &  Sonin,  1992),  where  Sc  =  vf k  is  the  Schmidt  number,  the  ratio 
of  the  kinematic  viscosity  of  the  fluid  to  the  diffusion  coefficient  of  the  dissolved  gas.  At 
higher  wind  speeds,  above  8  —  10ms_1,  there  is  a  break  point  in  the  linear  rise,  with  higher 
wind  speeds  giving  a  correspondingly  higher  transfer  velocity.  This  increase  in  the  transfer 
velocity  has  been  attributed  to  the  onset  of  breaking  and  bubble  entrainment,  but  recent 
experiments  by  Khoo  &:  Sonin  (1992)  in  which  the  turbulence  is  induced  by  agitation 
from  below  with  no  breaking  nor  air  entrainment  show  a  similar  break  and  enhancement 
of  the  gas  transfer.  Notwithstanding  this  uncertainty  regarding  the  mechanism  leading  to 
enhanced  gas  transfer  at  higher  wind  speeds,  Jaehne  (1990)  has  demonstrated  that  the 
addition  of  a  mechanically  generated  breaking  wave  to  a  wind-generated  wave  field  in  the 
laboratory  can  enhance  the  gas  transfer  by  a  factor  of  two. 

The  opportunites  for  field  studies  of  gas  transfer  have  improved  recently  with  the  avail¬ 
ability  of  relatively  rapid-  response  dissolved  gas  sensors  which  can  be  deployed  on  long 
term  moorings.  In  a  recent  field  experiment  in  the  Middle  Atlantic  Bight,  Wallace  &: 
Wirick  (1992)  deployed  a  pair  of  dissolved  oxygen  sensors  at  two  depths  (19m  and  34m) 
for  a  period  of  four  months.  They  found  that  the  time  series  of  dissolved  oxygen  were 
distinguished  by  sudden  large  increases  associated  with  surface  wave  activity,  followed 
by  longer  periods  of  degassing  between  storms  (Figure  6).  They  compared  their  mea¬ 
surements  with  semi-empirical  models  due  to  Thorpe  (1984)  and  Spitzer  &  Jenkins(1989) 
(each  including  bubble  transport)  and  a  thin-film  model  (excluding  bubble  transport).  2 
They  concluded  that  Thorpe’s  model  gave  the  best  agreement  with  the  measurements, 
but  by  being  forced  by  wind  rather  than  waves  it  led  to  some  discrepancies.  One  of  the 
primary  causes  of  the  enhanced  gas  transfer  due  to  bubbles  is  the  increase  in  diffusion 
of  gas  across  the  bubble  surface  due  to  the  increase  in  hydrostatic  pressure  with  depth. 
As  noted  in  the  introduction,  the  hydrostatic  pressure  at  10m  depth  is  twice  that  at  the 
surface.  Significant  bubble  populations  are  commonly  observed  at  10m  depth,  especially 
in  storms.  Impediments  to  improved  modelling  of  bubble  transport  include  uncertainties 
about  initial  distibutions  of  bubble  sizes  and  subsequent  breakup,  turbulent  transport 
and  surface  chemistry  and  contamination.  A  more  detailed  review  of  this  important  and 
fascinating  subject  is  beyond  the  scope  of  this  review,  but  the  reader  is  referred  to  Thorpe 
(1992)  for  a  current  assessment. 

It  is  believed  that  enhancement  of  gas  transfer  by  breaking  is  due  to  both  the  local  in¬ 
crease  in  turbulent  intensity  and  the  entrainment  and  dissolution  of  bubbles.  Kitaigorod- 
skii  (1984)  has  modelled  the  influence  of  patches  of  enhanced  turbulence  due  to  breaking. 
He  finds  that  the  transfer  velocity  k  is  proportional  to  5c-1|/2[j/el,(0)]1'/4,  where  e„(0)  is 
the  dissipation  of  turbulent  energy  near  the  surface.  3  While  noting  the  similarity  of  the 


2Note  that  theee  three  model  results  are  denoted  by  1,2,  3,  respectively,  in  Figure  6. 

3Note  that  Kitaigorodskii  used  the  Prandtl  number  to  denote  the  ratio  of  the  diffusion  of  momentum  to 
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scaling  of  the  velocity  to  Kolmogorov  scaling,  Kitaigorodskii  asserted  that  the  arguments 
leading  to  this  result  are  independent.  It  is  of  interest  to  determine  the  effects  of  the  en¬ 
hanced  dissipation  discussed  in  the  previous  section  in  the  light  of  Kitaigorodskii’s  result. 
From  our  estimates  of  Section  5  we  would  expect  an  enhancement  of  the  dissipation  by 
a  factor  of  at  least  60.  According  to  Kitaigorodskii  this  would  lead  to  an  increase  in  the 
transfer  velocity  by  at  least  a  factor  of  three. 

7.  The  Acoustics  of  Breaking  Waves 

Until  recently,  ambient  sound  in  the  audio  range  (20 Hz  —  20 kHz)  in  the  ocean  was 
broadly  classified  as  being  due  to  ocean  turbulence  (<  100 Hz),  shipping  traffic  (<  1  kHz), 
and  "wind”  noise  (100 Hz  —  20 kHz),  and  was  represented  by  smooth  empirical  spectra 
(Wenz,  1962).  While  it  was  recognized  that  much  of  the  wind  noise  was  in  fact  due 
to  breaking,  it  is  only  in  the  last  ten  years  that  attention  has  focussed  more  directly, 
and  in  a  quantitative  fashion,  on  the  relationship  between  ambient  noise  at  the  ocean 
surface  and  surface  wave  breaking.  The  beginnings  of  much  of  this  work  are  summarized 
in  the  proceedings  of  a  wonderful  conference  which  brought  together  acousticians  and 
hydrodynamicists  in  Lerici  in  1987  (Kerman,  1988).  By  that  time  field  measurements  had 
shown  that  discrete  acoustic  events  could  be  identified  with  breaking  (Farmer  &  Vagle, 
1988)  and  there  was  considerable  speculation  about  the  source  of  the  sound.  Sources  of 
sound  associated  with  the  entrainment  of  air  are  almost  certainly  the  dominant  source 
of  sea-surface  sound.  After  entrainment  the  volume  of  air  is  broken  up  into  smaller  and 
smaller  bubbles.  At  the  time  of  its  generation  a  bubble  is  not  in  its  equilibrium  shape  and 
as  it  relaxes  to  this  spherical  shape  it  oscillates  in  its  axisymmetric  (’’breathing”)  mode. 
This  volume  mode  of  oscillation  is  the  most  efficient  for  the  radiation  of  sound  and  has  a 
characteristic  frequency 

u)2  =  3~fP/pr2,  (21) 

where  7  is  the  ratio  of  the  specific  heats  of  the  gas,  P  is  the  equilibrium  pressure,  p  is 
the  density  of  the  liquid  and  r  is  the  equilibrium  bubble  radius.  For  a  bubble  of  radius 
1mm  in  water  at  atmospheric  pressure,  the  resonant  frequency  is  approximately  3.26 
kHz.  Banner  &:  Cato(1988)  presented  a  video  recording  of  the  sound  generated  in  this 
fashion  by  bubbles  pinching  off  in  a  quasisteady  spilling  breaker.  In  a  series  of  laboratory 
experiments,  Medwin  and  his  colleagues  (Medwin  &;  Beaky,  1989;  Medwin  &  Daniel, 
1990)  have  shown  that  this  is  the  dominant  source  of  sound  at  frequencies  greater  than 
approximately  400-500Hz. 

We  were  concerned  to  determine  whether  the  sound  generated  by  breaking  could  be  used 
to  do  more  than  just  identify  breaking  events.  Preliminary  experiments  were  undertaken 
which  showed  that  for  frequencies  greater  than  500Hz  the  sound  radiated  by  breaking 
waves  in  the  laboratory  correlated  with  the  energy  dissipated  (Melville  et  ah,  1988).  Now  if 
the  causes  of  this  correlation  could  be  understood,  and  if  the  result  also  applied  to  the  field, 
then  this  would  offer  an  efficient  way  of  monitoring  surface  wave  dissipation,  a  difficult 
measurement  to  make  directly  (See  above.)  These  preliminary  results  were  confirmed 
in  more  extensive  measurements  (Loewen  h  Melville,  1991a)  which  showed  that  of  the 


mass.  This  is  usually  reserved  for  heat  transfer. 
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Figure  7.  Moments  of  the  void  fraction  field  measured  under  unsteady  breaking  waves 
in  the  laboratory.  V/Vo :  normalized  total  volume  of  air;  Ab/V0\  normalized  area  of  the 
bubble  cloud,  where  b  is  the  breadth  of  the  channel;  a:  mean  void  fraction  by  percent; 
Ehj Ed'-  ratio  of  the  potential  energy  of  the  bubble  cloud  to  the  total  energy  dissipated  in 
breaking.  (Lamarre  &;  Melville,  1991a). 
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Figure  8.  Comparison  of  measured  frequency  of  collective  oscillations  under  unsteady 
breaking  waves  and  computed  frequency  based  on  void  fraction  measurements  (Loewen& 
Melville,  1992). 
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order  of  10~8  of  the  mechanical  energy  dissipated  was  radiated  as  sound.  Simple  semi- 
empirical  modelling  (Loewen  &  Meiville,  1991b)  showed  that  for  gently  spilling  waves  the 
sound  radiated  above  approximately  4Q0Hz  (Medwin  &  Beaky,  1989;  Medwin  &  Daniel, 
1990)  could  be  reproduced  with  a  simple  dipole  model  of  bubbles  entrained  at  the  surface. 
Inplicit  in  this  result  is  the  fact  that  the  sound  radiated  is  proportional  to  the  volume  of 
air  entrained. 

While  these  results  were  encouraging  for  gently  spilling  waves  they  did  not  account  for 
the  air  entrainment  and  acoustics  of  more  energetically  breaking  waves  in  which  large 
volumes  of  air  are  entrained.  In  these  cases  it  more  useful  to  consider  the  whitecap  as 
a  continuum  described  by  its  geometry  and  void  fraction  field,  rather  than  a  collection 
of  discrete  bubbles.  By  using  the  electrical  conductivity  of  the  air-water  mixture  in  the 
whitecap  we  were  able  to  measure  the  void  fraction  field  and  hence  the  evolution  of 
the  bubble  cloud  in  unsteady  breaking  waves  (Lammare  &:  Melville,  1991a)  .  It  was 
found  that  all  of  the  lower  order  moments  of  the  void-fraction  field  evolved  as  simple 
exponential  or  power  law  functions  in  time  (Fig.  7).  But  the  most  important  result  of 
this  work  was  to  show  that  the  energy  expended  in  entraining  the  air  against  buoyancy 
forces  could  account  for  up  to  50%  of  the  total  mechanical  wave  energy  dissipated.  This 
result  serves  to  support  the  earlier  empirical  correlation  between  energy  dissipation  and 
sound  radiation.  Subsequent  field  work  using  similar  conductivity  probes  showed  that  the 
large  void  fractions  measured  in  the  laboratory  (0(10%))  were  also  to  be  found  in  the 
field  (Lamarre  &  Melville,  1991b). 

Bubble  clouds  may  also  oscillate  as  a  whole  in  what  are  called  ’’collective  oscillations”. 
The  possibility  that  such  oscillations  could  occur  in  bubble  clouds  entrained  by  breaking 
waves  had  been  considered  by  Prosperetti  (1988)  and  by  Carey  and  Browning  (1988)  as 
a  source  of  low  frequency  sound  (<  500 Hz)  at  the  ocean  surface.  The  lowest  mode  of 
oscillation  of  a  spherical  bubble  cloud  has  a  frequency 
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where  R  is  the  radius  of  the  cloud,  7  is  the  ratio  of  the  specific  heats,  P0  is  the  equilibrium 
pressure  inside  the  individual  bubbles  comprising  the  cloud,  and  a  is  the  mean  void 
fraction.  If  isothermal  conditions  apply  7  =  1.  For  example,  if  Rq  =  0.3m,  a  =  0.1, 
and  P0  =  lOlfcPa  then  fi  =  30  x  2rrrads~1.  Confirmation  of  the  essential  characteristics 
of  collective  oscillations  in  laboratory  models  have  been  obtained  by  Yoon  et  al.  (1991) 
who  generated  a  vertical  bubble  column  above  a  bubble  generator,  and  by  Kolaini  et 
al.,(  1991)  who  dropped  volumes  of  water  onto  still  water  surfaces.  However,  Hollet 
(1989)  and  Farmer  &  Vagle  (1988)  had  earlier  shown  in  the  field  that  low  frequency  sound 
accompanies  wave  breaking.  Evidence  that  collective  oscillations  may  be  the  source  of  low 
frequency  sound  in  breaking  waves  was  provided  in  two  complementary  sets  of  laboratory 
measurements.  In  both  two-  and  three-dimensional  breaking  waves  Lamarre  &  Melville 
(1992)  measured  the  void  fraction.  Following  Lu  et  al.  (1990)  these  data  were  then 
used  to  predict  the  eigenfrequencies  of  the  collective  oscillations  of  the  bubble  clouds  and 
compared  with  low  frequency  sound  measurements  by  Loewen  &  Melville(1992).  It  was 
found  that  for  spilling  breaking  the  sound  radiated  below  approximately  500Hz  did  not 
vary  with  the  strength  of  the  break;  however,  on  the  transition  to  plunging  breaking  the 


low  frequency  sound  was  clearly  a  function  of  the  strength  of  the  break,  and  that  spectral 
peaks  could  be  identified.  The  results  of  this  comparison  between  measured  and  predicted 
collective  oscillation  are  shown  in  Figure  8. 

The  success  of  these  relatively  simple  acoustic  models  in  predicting  parameters  describ¬ 
ing  the  sound  radiated  by  breaking  waves  is  reason  for  confidence  that  ambient  sound 
may  be  a  useful  tool  for  learning  more  about  surface  fluxes.  For  example,  acoustic  char¬ 
acterization  of  collective  oscillations  may  lead  to  estimates  of  the  volume  of  air  entrained 
by  breaking  waves.  This  is  important  initial  data  for  models  of  gas  transfer. 
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Abstract 

This  review  summarises  some  recent  findings  about  ocean  surface 
waves  at  the  two  extremes  of  scale  -  the  expected  configurations  of 
extreme  wave  groups  (giant  waves)  on  the  one  hand  and  on  the  other, 
the  processes  whereby  short  wind -generated  gravity  waves  are  sup¬ 
pressed  by  their  interaction  with  longer  waves.  A  simple  approxi¬ 
mate  theory  expresses  the  space -time  configuration  of  extreme  waves 
in  terms  of  the  autocorrelation  function  of  the  wave  field  as  a 
whole,  and  this  is  tested  by  buoy  and  Surface  Contour  Radar  measure¬ 
ments  made  in  the  field  during  the  SWADE  experiment.  The  interac¬ 
tions  between  (not  so  extreme)  long  wave  groups  and  shorter 
wind-generated  gravity  waves  have  been  explored  in  the  laboratory  in 
the  NASA-Wallops  Wind  Wave  facility.  The  short  waves  are  suppressed 
by  breaking  during  the  passage  of  the  group,  their  significant  slope 
at  the  long  wave  crests  remaining  constant  as  the  amplitude  and 
wavelength  both  decrease.  The  gradual  regeneration  of  these  waves 
by  the  wind  is  interrupted  in  the  tank  by  a  wave  energy  front  origi¬ 
nating  from  the  upwind  end. 

Introduction 

People  who  look  at  waves  on  the  surface  of  the  sea  must  have 
noticed  both  their  randomness  and  their  quasi-regularity.  Their 
detailed  configuration  changes  constantly,  yet  the  dominant  wave 
groups  usually  have  a  comforting  regularity.  Occasionally,  an 
extremely  high  crest  or  deep  trough  'appears  from  nowhere'  (as  eye¬ 
witnesses  have  claimed)  sometimes  causing  damage  or  loss  of  life. 

In  a  moderate  wind,  the  surface  may  be  covered  by  distinct  wave 
groups  or  trains  a  meter  or  two  in  length,  but  under  high  winds  with 
much  longer  big  waves,  the  surface,  streaked  with  spume  and  rippled 
with  very  short  capillary  waves,  often  seems  to  have  largely  lost 
these  intermediate  scale  gravity  waves.  In  a  hard  driven  wind  sea 
when  the  longest  dominant  waves  are  breaking,  the  breaking  zones 
erase  shorter  slower  waves  leaving  a  smoother  surface  behind.  When 
the  longest  waves  themselves  are  not  breaking,  any  shorter  waves 
present  do  tend  to  break,  intermittently  and  sporadically,  near  the 
crests  of  the  largest  waves.  The  dynamics  of  the  breaking  process 
have  been  elucidated  greatly  by  the  fine  experiments  of  Duncan 
(1981),  Melville  and  Rapp  (1985)  and  others. 


There  is  clearly  a  melange  of  processes  occurring,  and  I  would 
like  to  concentrate  on  just  two  of  these.  The  work  that  I  review  is 
partly  theoretical;  the  measurements  have  been  made  both  in  the  lab¬ 
oratory  and  in  the  field,  and  a  number  of  people  have  contributed 
substantially.  Daifang  Gu  was  Involved  with  the  theory  and  did  the 
data  analysis  of  extreme  waves,  Mark  Donelan  and  Edward  Walsh  gener¬ 
ously  provided,  and  allowed  our  use  of,  hard-won  data  from  the  Sur¬ 
face  Wave  Dynamics  Experiment  (SWADE)  in  the  western  Atlantic  Ocean 
off  Virginia;  Jacob  Chu  and  Steven  R.  Long  conducted  the 
NASA-Wallops  experiments  and  the  extensive  data  analysis.  To  them 
all,  my  sincere  thanks. 


Extreme  wave  events,  giant  waves  or  rogue  waves,  have  been 
responsible  for  many  marine  accidents,  some  involving  loss  of  life. 
They  occur  under  storm  conditions  when  the  waves  are  already  high, 
perhaps  amplified  further  by  refraction  in  currents  such  as  the 
Agulhas  current  off  eastern  South  Africa  or  the  Gulf  Stream.  The 
very  highest  individual  waves,  the  rogue  waves,  seem  to  appear  with 
little  warning  and  may  be  regarded  as  the  statistical  extremes  in  an 
already  rough  random  sea,  occurring  sporadically  in  space  and  time. 
Although  the  occurrence  of  these  events  may  be  random,  it  is  of 
interest  to  inquire  whether  in  the  vicinity  of  extreme  wave  crests, 
there  is  any  predictable,  expected  configuration  of  the  sea  surface, 
any  'organized  structure'  of  the  surface  in  space  and  time  with 
which* a  mariner  and  his  ship  must  cope. 

Any  such  'organized  structure'  in  a  random  sea  (or  random  func¬ 
tion  in  general)  is  not  of  course  deterministic  in  the  sual  sense, 
but  the  selection  of  regions  surrounding  extreme  maxima  does  extract 
a  regularity  from  the  randomness  of  the  field  as  a  whole.  Suppose, 
for  example,  we  have  an  extensive  record  £(x.<)  of  surface  displace¬ 
ment  in  a  random  wind-generated  wave  field  as  a  function  of  position 
and  time.  From  this  record,  let  us  pick  out  the  high  wave  crests  - 
extract  instances  of  wave  maxima  lying  between  t,m  and  +  where 
is  a  large  multiple  of  the  r.m.s.  wave  height,  and  consider  the 
nature  of  the  surface  displacements  surrounding  these  maxima.  At 
the  maxima,  the  surface  displacements  are  all  in  essence  the  same, 
£m.  Close  to  the  maxima,  the  surface  displacements  The 

expected  or  mean  surface  displacement  is  somewhat  less  than  while 
still  being  very  large,  and  the  variance  about  this  mean  is  small. 

As  the  distance  in  space  or  time  from  the  maxima  increases,  the 
expected  value  of  decreases  and  the  variance  among  different  real¬ 
izations  increases  until  ultimately,  far  from  the  extreme  events, 
the  order  is  lost,  the  expected  position  of  the  free  surface  is 
simply  the  mean  water  level,  £-0  and  the  variance  is  that  of  the 
overall  wave  field.  This  paper  is  concerned  with  the  question: 

Given  the  existence  of  an  extreme  wave  crest,  what  is  the  expected 
surface  configuration  surrounding  this  '’rest  or,  equivalently,  the 
mean  over  many  realizations  of  extreme  events,  and  what  is  the  dis¬ 
tribution  of  variance  about  this  mean?  This  question  seems  to  have 
been  asked  first  by  Boccotti  (1981-1989)  of  the  University  di  Reggio 
Calabria  in  Italy.  In  a  random  sea,  one  cannot  predict  where  or 
when  extreme  crests  or  troughs  will  appear,  but  one  can  predict  what 
their  expected  configuration  will  be  when  they  do  occur. 

The  direct  calculation  of  the  expected  surface  displacement 
C(x.f),  given  that  at  the  origin,  say,  t,  has  an  instantaneous  maximum 
whose  value  is  a  substantial  multiple  of  the  r.m.s.  value 

°“(f>z)  >  is  extremely  cumbersome  and  has  not  been  done.  The  defi- 


nition  of  a  maximum  requires  that  the  surface  slope  Vt.be  zero,  the 
curvature  be  negative,  i;  be  zero  and  the  vertical  acceleration 
negative  and  conditional  probabilities  with  all  these  restrictions 
make  for  impossibly  tedious  algebra.  Let  us  ask  the  question  in  a 
different  way. 

Rather  than  seeking  the  precise  points  where  realizations  of 
K(.x,t)  attain  maxima,  let  us  consider  those  regions  where 

£  >  y(fc,2)12  -  y°.  where  y  is  (formally)  a  number  large  compared  with 
unity.  For  a  given  y,  at  any  instant  these  regions  consist  of  iso* 
lated  islands  each  containing  at  least  one  maximum  and  as  y 
increases,  the  islands  shrink,  converging  towards  the  maxima  and 
then  disappearing.  Our  interest  is  in  large  values  of  y,  where 
there  are  rare,  small,  isolated  islands  in  which  !;>yo  and  let  us 
pose  the  question  thus:  Given  that  at  x.l,  say,  i,  >  ya  where  y  is 

large,  what  is  the  expected  distribution  of  f;  in  the  vicinity  and 
what  is  the  standard  deviation  about  this  expected  value?  The 
expected  distribution  in  space  and  time  describes  the  configuration 
and  evolution  of  the  extreme  wave  events  and  the  standard  deviation 
is  the  random  uncertainty.  This  formulation  allows  a  very  much  sim¬ 
pler  calculation. 

Accordingly,  let  -  i;(x  +  r.  t  +  t),  t,2  =  o2  and 

p(r  •  t)  -  i,,^2/az,  the  autocorrelation  function  for  the  surface  dis¬ 
placement.  From  the  theorem  of  conditional  probability,  the  distri¬ 
bution  of  ^2  given  that  £i>yo  is,  in  the  usual  notation, 

P(^2  I  £ i  -  Y°)  ’  P(*>2‘  *>i  -  YflVP(^i  -  Y°)- 

and,  for  a  Gaussian  process,  the  righthand  side  is  easy  to  calculate 
(Phillips,  Gu  and  Donelan,  1992).  After  some  calculation,  it  is 
found  that  the  expected  value  of  given  that  is  >  yo  is  then 


^2  *  ftzPtt z^,>ya)dt,3 


-Yop[/(Y  )J  ■  (2-6) 

where  /(y)  is  an  integral  expression  whose  asymptotic  form  is 

/( y)  ~  ]  -  y~2  +  3y'4-  ...  for  y»l.  (2.4) 

In  particular,  when  r-O.t-O,  the  autocorrelation  function  p  -  1  so 
that  the  average  height  of  those  waves  higher  than  yo  is  yo[/(y)l  1  so 
that 


t,(x  +  r,l  +  - c) 

— - ~ - -  P(r.x). 

The  simplicity  of  this  result  is  a  surprise.  It  is  an  approximation 
in  two  senses  -  in  the  use  of  a  Gaussian  distribution  and  in  averag¬ 
ing  over  points  in  islands  surrounding  maxima  rather  than  averaging 
relative  to  the  maxima  themselves.  Let  us  see  how  well  this  simple 
approximation  works. 
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When  r  -0,  we  are  concerned  with  waves  measured  in  time  at  a 
single  point  by  a  wave  staff  or  buoy.  In  this  simple  case,  the 
direct  calculation  of  the  expected  history  of  surface  displacement 
surrounding  a  high  maximum  is  manageable  with  a  narrow  spectrum 
approximation  (Phillips,  Gu  and  Donelan,  1992).  Though  more  cumber* 
some  than  the  calculation  leading  to  the  result  above,  it  does  pro¬ 
vide  a  theoretical  check  on  the  latter.  Time  series  data  obtained 
in  SWADE  by  a  buoy  at  38.37<>N,  73.65°W  during  a  storm  on  26  October, 
1990  appeared  to  be  closely  Gaussian,  and  the  autocorrelation  func¬ 
tions  p(t)  were  calculated  for  each  of  four  data  segments.  The 
records  were  then  searched  for  maxima  larger  than  a  chosen  multiple 
of  a.  The  time  of  occurrence  of  a  high  wave  is  denoted  by  t„  and  the 
average  surface  displacement  in  the  vicinity,  + t)  was  found  as  a 
fraction  of  £({„,)  by  averaging  over  the  ensemble  of  maxima.  A  sample 
of  this  is  given  in  Figure  1,  the  solid  line  showing  the  expected 
configuration  using  the  direct  calculation  and  the  narrow  spectrum 


Figure  1.  Averaged  time  histories  of  *;({)  in  buoy  data  surrounding 
high  maxima  with  y-2  (top)  and  2.5  (bottom)  compared  with  the 
expected  configuration  (solid  line)  calculated' for  a  narrow 
spectrum.  For  y-  2. 5  the  sample  size  is  of  course  smaller. 


approximation,  while  the  dotted  and  dot-dash  lines  show  the  means  of 
the  displacements  following  and  preceding  the  maximum.  Figure  2 
shows  a  comparison  between  the  approximate  theory  (solid  line)  and 
the  measured  averaged  time  histories  surrounding  maxima.  The  direct 
calculation  and  the  approximate  theory  are  virtually  indistinguish¬ 
able  in  this  case  (which  is  typical  of  others  examined) ;  near  the 
maximum  the  measured  average  displacements  are  very  close  to  the 
theory  but  develop  more  random  variations  as  the  time  interval 


Increases  and  correlation  is  lost.  Far  from  the  maxima  of  course, 
the  expected  surface  displacement  is  zero  with  a  standard  deviation 
of  a,  that  of  the  wave  field  as  a  whole. 

A  much  more  interesting  case,  and  a  much  more  demanding  test  of 
the  result  above,  concerns  the  expected  spatial  configuration  of  the 
surface  surrounding  high  maxima.  No  direct  calculation  is  available 
for  this,  but  according  to  the  approximate  theory,  the  expected 
instantaneous  configuration  surrounding  high  crests  should  be  given 
by  the  spatial  autocorrelation  function,  with  t-0.  This  function 
is  of  course  the  Fourier  Transform  of  the  two-dimensional  wave  spec¬ 
trum  and  its  form  will  depend  on  the  directional  characteristics  of 
this  spectrum  as  well  as  its  scalar  wave -number  dependence. 

Extensive  data  on  the  spatial  configuration  of  the  sea  surface 
under  various  wave  conditions  have  been  obtained  by  Dr.  Edward 
Walsh's  group  during  SWADE  using  their  surface  contour  radar. 
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Figure  2.  As  in  figure  1,  but  now  compared  with  the  temporal  auto¬ 
correlation  function  p(t). 


Briefly  stated,  the  surface  contour  radar  is  an  aircraft-mounted 
radar  altimeter  whose  beam  sweeps  to  and  fro  transverse  to  the 
flight  path;  the  distance  from  aircraft  to  the  water  surface  being 
measured  by  the  return  time  of  the  individual  radar  pulses.  The 
transverse  sweeps  mark  out  a  swath  along  the  flight  track  whose 
width  is  proportional  to  the  aircraft  altitude  and  the  measurements 
provide  a  nearly  instantaneous  'map'  of  the  wave  pattern  in  the 
swath  below.  From  the  digitized  data,  the  spatial  autocorrelation 
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function  p(r)  can  readily  be  found;  as  before,  the  data  are  then 
searched  for  high  maxima  (or  low  minima)  and  the  ensemble  of  surface 
configurations  around  these  points  is  averaged  to  define  )/^m. 

This  is  to  be  compared  with  p(r)-. 

As  examples,  consider  two  contrasting  wave  conditions  encoun¬ 
tered  on  4  and  5  March  1991.  For  a  couple  of  days  prior  to  the  4th, 
the  wind  was  fairly  steady  from  the  south,  and  the  wave  field  at  the 
measurement  region  (south  and  east  of  the  Gulf  Stream)  was  uni- 
modal,  with  dominant  waves  moving  north,  having  a  narrow  directional 
spread.  The  significant  wave  height  was  about  5  m.  The  spatial 
autocorrection  function  of  waves  in  the  entire  swath,  shown  in  Fig- 


Figure  3.  (a)  Spatial  autocorrelation  function  p(r)  in  a  uni-modal 

wind  sea.  (b,  c,  d) ,  the  mean  surface  configurations  of  the  sea 
surface  surrounding  high  maxima  with  y  -  2,  2.5  and  3.5  respec¬ 
tively. 


ure  3  a,  is  consistent  with  a  field  of  rather  long-crested  waves, 
with  a  dominant  wavelength  of  about  250  m  travelling  slightly  to  the 
right  of  the  flight  path.  Maxima  for  £ >  ya  were  then  identified  for 
values  of  y-2,  2.5  and  3.5  and  the  ensemble  averaged  surface  con¬ 
figurations  in  the  vicinities  of  the  maxima  calculated.  The  results 
are  shown  in  Figures  (3b,  c,  d)  .  For  y-2  the  average  configuration 


is  remarkably  close  to  p(r);  as  y  increases,  the  theory  should  get 
better  but  the  statistics  deteriorate  (fewer  realizations)  so  that 
the  'noise'  increases.  The  dominant  characteristics,  however,  per¬ 
sist. 

During  the  following  night  there  was  a  cold  air  outbreak,  with 
strong  winds  from  the  North-West  and  on  5  March,  further  measure¬ 
ments  were  taken  a  little  to  the  south  of  the  previous  site.  The 
combination  of  wind  sea,  originally  travelling  to  the  south  east, 
but  refracted  and  scattered  by  passage  through  the  Gulf  Stream,  a 
residual  10-second  swell  travelling  north  produced  a  sea  state  with 
a  very  wide  directional  spread  in  its  dominant  waves.  The  autocor- 


Figure  4.  (a)  Spatial  autocorrelation  function  p(r)  in  a  wave  field 
with  a  wide  directional  spread  of  the  dominant  frequencies.  (b, 
c,  d)  mean  instantaneous  surface  configurations  surrounding  high 
maxima  with  y  ”  2. 2.5  and  3  respectively. 

relation  function  p(r)  had  the  rather  remarkable  form  shown  in  Figur 
(4a)  and  the  mean  instantaneous  configurations  surrounding  high 
maxima  are  illustrated  in  Figures  (4b,  c,  d)  for  increasing  values 
of  y  (remember  that  the  vertical  scale  is  exaggerated).  The  waves 
are  evidently  very  short  crested  as  is  consistent  with  their  wide 
directional  spread,  having  more  the  nature  of  a  transient  standing 


wave  formed  from  wave  groups  travelling  in  almost  opposite  direc¬ 
tions.  The  average  configuration  surrounding  minima  with  y«.2  is 
shown  in  figure  5  -  to  a  helmsman,  the  trough  would  look  like  a 
'black  hole'.  Eyewitness  reports  of  giant  waves  frequently 
described  them  as  "coming  out  of  nowhere".  It  is  interesting  to 
note  that  this  location  is  near  "the  north  wall  of  the  Gulf  Stream", 
a  region  infamous  for  erratic  and  extreme  waves. 


0 


Figure  5.  Mean  configuration  surrounding  extreme  troughs  in  the 
same  data  set  as  for  figure  4,  y-2.  The  bottom  of  the  trough 
(out  of  sight)  extends  to  -1. 


At  the  other  end  of  the  spectrum,  we  have  the  short  gravity  and 
capillary  waves.  Mitsuyasu  (1966)  discovered  that  in  the  presence 
of  swell,  short  gravity  wave  components  of  a  wind-wave  field  were 
considerably  smaller  than  they  were  without  the  swell,  all  other 
conditions  being  the  same.  The  effect  was  also  observed  by  Phillips 
and  Banner  (1974)  and  Donelan  (1987)  in  wind-generated  waves  super¬ 
imposed  on  continuous  trains  of  swell.  Although  the  phenomenon  is 
well  known,  there  1  consensus  ahcu*  the  mechanisms  involved. 
Phillips  and  Banner  ascribe  it  to  enhanced  breaking  of  the  wind 
waves  at  the  swell  crests,  while  Donelan  pointed  out  that  the  oscil¬ 
latory  straining  of  the  short  waves  by  the  swell  can  disrupt  the 
energy  transfer  to  them  from  longer  waves  by  resonant  wave 
interactions .  It  occurred  to  us  that  the  nature  of  the  mechanism 
might  be  clarified  if  the  wind  waves  were  subjected  to  the  passage 
of  a  group  of  longer  waves  rather  than  a  continuous  train  of  them. 
Enhanced  wave  breaking  provides  a  rapid  energy  loss  and  if  this  is 
the  process  responsible,  the  reduction  in  energy  density  should  also 
be  found  after  a  long  wave  group  had  passed.  On  the  other  hand, 
wave-wave  intereactions  have  time  scales  very  much  longer  than  the 
wave  period  so  the  disruption  of  them  as  the  group  passes  should 
have  little  net  effect. 
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Figure  6.  An  v!  diagram  of  the  wind-wave,  swell  group  interaction 
experiment. 
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Figure  7.  Ensemble  means  of  the  surface  displacement  specify  the 
propagation  of  wave  groups  past  the  measurement  stations.  Left 
column,  fetches  of  5.2,  5.9m;  right,  9.5,  14.4m. 


The  experiments  that  I  will  describe  now,  then,  involve  a  pre¬ 
existing  statistically  steady  wind  wave  field  in  a  laboratory  wind 
wave  facility  into  which,  at  the  upstream  end,  groups  of  longer, 
mechanically  generated  waves  are  injected,  the  time  histories  of  the 
wave  fields  being  measured  at  various  points  down  the  tank  before, 
during  and  after  the  passage  of  the  groups.  Because  of  the  random¬ 
ness  of  the  wind- generated  waves  and  the  absence  of  stationarity ,  it 
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Figure  8.  Time  histories  of  for  the  wind  waves  at  u.=  15cm/s 
(left  column)  and  27  cm/s  (right)  at  fetches  5.1,  (top  row), 
5.9m,  9,5m  and  14.4m  (bottom  row). 


was  necessary  to  repeat  each  experiment  a  large  number  of  times 
(100),  recording  the  data  digitally  in  order  to  obtain  ensemble 
averages.  Although  initially  motivated  by  the  questions  described 
above,  the  experiment  revealed  a  number  of  different  phenomena  and 
several  surprises,  and  to  put  them  into  context,  consider  the  x-t 
diagram  of  Figure  6  which  provides  a  'road-map',  of  the  experiment, 
with  fetch  down  the  tank  plotted  vertically  and  elapsed  time  hori¬ 
zontally.  Initially,  there  is  a  statistically  steady  field  of  wind 
waves  growing  with  fetch;  the  stationary  observation  points  are 
indicated  by  the  horizontal  lines.  A  group  of  longer  waves  is  then 
generated  at  zero  fetch  and  passes  down  the  tank,  moving  with  its 
group  velocity,  catching  up  and  overtaking  the  wind  waves  whose 

froup  velocity  is  less.  For  each  set  of  conditions,  the  experiment 
s  repeated  100  times  with  identical  input  pulses  of  the  wavemaker. 
The  ensemble  average  of  the  surface  displacement  is  just  the  (deter¬ 
ministic)  long  wave  group  (Figure  7);  when  this  is  subtracted  from 
each  realization,  the  residual  is  the  distribution  of  short 
wind- generated  waves  (as  a  function  of  time  at  each  measurement  sta¬ 
tion)  as  modified  by  interaction  with  the  long  wave  groups.  Care 
had  to  be  exercised  in  extracting  short  wave  information 
particularly  during  the  interaction,  and  details  are  given  by  Chu, 
Long  and  Phillips  (1992) . 


Figure  9.  Apparent  frequencies  of  wind  waves  at  crests  and  troughs 
of  the  swell,  with  slope  (ak)  and  frequency  n.  The  solid  line 
is  calculated  from  the  kinematic  conservation  equation. 


Figure  8  represents  the  time  histories  at  fixed  points  of  the 
mean  square  surface  displacement  of  the  short  waves  in  a  representa¬ 
tive  sample  of  the  experiments,  each  being  averaged  over  the  100 
realizations.  It  is  clear  that  in  each  case,  the  wind  waves  are 
suppressed  immediately  during  the  passage  of  the  long  wave  group, 
the  event  being  marked  by  large  spikes  on  the  left  of  the  figures 
resulting  from  'phase  jitter'  of  the  long  waves  among  the  different 
realizations.  This  suggests  that  in  these  experiments,  enhanced 
breaking  of  the  wind- generated  waves  is  indeed  the  process  responsi¬ 
ble  for  suppression,  since  it  provides  a  rapid  energy  loss  mecha¬ 
nism.  This  sugggestion  is  reinforced  by  measurements  of  the  short 
waves  during  the  passage  of  the  group.  A  short  wave  energy  packet 
is  stretched  and  compressed  by  the  orbital  velocities  of  the  longer 
waves,  the  magnitude  increasing  in  the  first  half  of  the  overtaking 
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long  wave  group  as  successive  wave  crests  are  higher  and  higher. 

Its  apparent  frequency  thus  oscillates  with  increasing  amplitude  and 
Figure  9  shows  the  (scaled)  measured  frequency  variation  as  a  func¬ 
tion  of  long  wave  slope.  The  line  shows  what  one  would  expect  from 
wave  kinematics;  the  observed  frequency  is  a  little  higher  than  the 
line  for  high  crests,  possibly  because  of  amplified  wind  drift  which 
is  neglected  in  the  calculation. 

The  mean  square  surface  displacement  of  the  wind  waves  at  the 
swell  crests,  relative  to  that  before  the  arrival  of  the  swell 
group,  is  shown  in  Figure  10.  In  the  absence  of  wave  breaking,  this 
would  increase  with  ak  because  of  the  convergence  of  flow  at  the 
crests  of  the  long  waves  combined  with  the  decrease  in  local  appar¬ 
ent  gravity.  Clearly,  in  the  experiments  t;2  decreases  as  successive 
wave  crests  become  higher.  Interestingly,  though,  the  'significant 

slope'  (t,2)  /A*.  where  X.d  is  the  dominant  wavelength,  remains  sensi¬ 

bly  constant  as  Figure  11  shows  -  the  wavelength  at  the  crest  of  the 
swell  varies  as  (1  -  ak)  -  indicating  that  as  they  are  deformed  by 
the  swell,  the  short  waves  retain  a  level  of  geometrical  similarity, 
as  one  might  expect  of  intermittently  breaking  waves. 
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Figure  10.  (left)  Mean  square  surface  displacement  of  the  wind 

waves  at  the  swell  crests  (referred  to  undisturbed  values)  as  a 
function  of  swell  slope  ak. 

Figure  11.  (right)  The  corresponding  significant  slopes. 


After  the  highest  swell  crest  has  swept  through  the  short  waves, 
breaking  appears  largely  to  cease  on  the  successively  smaller  crests 
that  follow.  When  the  group  has  gone  by,  the  water  surface  appears 
much  smoother  than  before  even  to  the  eye.  This  is  evident  also  in 
measurements  of  the  mean  square  slope  shown  in  Figure  12.  At  the 
higher  slopes  of  the  swell,  the  wind  wave  slope  is  reduced  enor¬ 
mously;  the  smaller  scales  in  the  wind  wave  field  which  contribute 
significantly  to  the  slope  seem  to  have  in  effect  disappeared. 

Following  the  suppression,  the  wind  waves  gradually  regenerate, 
though  at  a  rate  p  apparently  slower  than  that  indicated  by  Plant's 
formula  p  =  0.04(u./c)2.  The  best-fit  coefficient  from  these  experi- 
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Figure  12.  Mean  square  slope  at  a  fetch  of  5.9m  as  a  function  of 
time,  measured  using  two  probes  1  cm  apart.  left  panel: 
u.-lScms'1,  ak  —  0.1;  right:  u.*  IScms'1,  ak-0.26;  bottom: 
u.m27cms  l,  ak  -  0.26. 
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Figure  13.  Local  spectra  at  diLferent  phases  of  the  interaction. 
(1)  initially  undisturbed,  (2)  after  passage  of  the  long  wave 
group,  (3)  in  the  wave  energy  front  and  (4)  after  the  recovery 
phase . 
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ments  is  5.2 x  10‘3,  being  smaller  possibly  because  of  the  local 
smoothness  of  the  water  surface  under  these  conditions. 

This  recovery  is  interrupted,  particularly  at  small  fetches,  by 
the  sudden  arrival  of  a  front  of  waves  longer  and  higher  than  the 
original  undisturbed  wind  waves.  As  Figure  7  shows,  this  is  a  con¬ 
sistent  and  reproducible  occurrence.  Figure  13  shows  xocal  spectra 
at  various  points  in  the  interaction  -  one  is  the  initial  field,  two 
the  waves  after  suppression  (note  that  the  dominant  frequency  is  the 
same  but  the  spectral  density  is  reduced  over  the  whole  range) , 
three  is  the  spectrum  in  the  wave  energy  front  with  a  higher  energy 
density  and  lower  frequency  than  the  initial  field,  four  is  the 
spectrum  at  the  end  of  the  window,  when  the  waves  have  apparently 
returned  to  their  initial  state.  When  we  calculate  the  significant 
slope  of  the  waves  in  the  energy  front,  we  find  that  this  again  is 
the  same  as  in  the  undisturbed  field  -  they  are  higher  because  they 
are  longer.  We  do  not  altogether  understand  yet  tne  de  ;ailed  dyna¬ 
mics  of  this  wave  energy  front.  They  are  clearly  very  different 
from  the  sharp  front  in  tne  frequency  spectrum  of  wind  waves.  The 
wave  frequency  is  found  to  gradually  decrease  as  the  front  propa- 

fates  down  the  tank,  and  1  think  its  evolution  involves  leakage  of 
ow  frequency  components  ahead  of  the  front  by  dispersion,  which  are 
then  amplified  by  the  wind  until  they  reach  saturation. 

Now  that  1  have  come  to  things  that  I  do  not  understand,  I  think 
I  should  stop.  I  am  happy  to  acknowledge  the  support  of  the  U.S. 
Office  of  Naval  Research  for  these  projects  and  the  grants  N00014- 
90-J-1623  and  1482. 
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Abstract 

The  effect  of  vorticity  or  shear  on  the  inviscid  evolution  and  instability  of  surface 
waves  is  discussed.  The  instability  of  a  flat  surface  is  considered  in  detail.  For  the 
case  of  shear  in  the  water,  the  instability  is  examined  by  means  of  Howard’s  semi-circle 
theorem,  which  is  shown  to  elucidate  the  qualitative  nature  of  the  instability  domain. 
This  can  then  be  determined  numerically  without  recourse  to  analysis  of  the  critical 
layer.  The  dependence  of  growth  rate  on  the  air  profile  for  the  case  of  wind  blowing 
over  water  at  rest  is  similarly  considered.  For  bounded  laminar  profiles,  the  results  are 
found  to  be  sensitive  to  the  profile  shape,  and  Miles’s  approximate  formula  is  found 
to  be  inaccurate.  On  the  other  hand,  for  unbounded  logarithmic  profiles,  the  growth 
rates  are  relatively  insensitive  to  the  profile  shape.  However,  there  is  still  disagreement 
between  numerical  calculations  and  Miles’s  formula,  which  overestimates  the  growth 
rate  by  a  factor  of  two.  A  simple  model  is  presented  for  the  case  of  shear  in  both  water 
and  air. 


1.  INTRODUCTION 

There  are  two  problem  areas  in  which  wind  and  water  shear,  or  equivalently  vorticity 
in  the  air  and  the  water,  affect  the  instabilities  of  surface  waves  at  an  air-water  interface. 
First,  there  is  the  question  of  the  instability  of  existing  waves,  generated  mechanically  for 
example,  or  by  earlier  winds.  In  this  situation,  the  dynamics  of  the  air  is  unimportant 
and  the  hydrodynamics  of  the  water  motion  is  the  question.  There  are  then  three 
categories  of  wave  instability,  as  follows. 

(i)  Two-  and  three-dimensional  long  wave  envelope  modulations  or  side  band  insta¬ 
bilities: 

These  go  generally  under  the  name  of  the  Benjamin  and  Feir  [1]  instability.  Instabil¬ 
ities  of  this  kind  are  also  described  by  Lighthill  [2]  (who  used  Whitham’s  variational 
principle),  Zakharov  [3],  [4]  and  others.  These  instabilities  have  non- water  wave  inter¬ 
est,  because  they  exhibit  the  phenomenon  of  recurrence,  and  also  because  the  envelope 
modulation  while  weakly  nonlinear  is  described  by  the  cubic  Schrddinger  equation. 
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(ii)  Resonant  interactions,  Phillips  [5]: 

This  describes  the  transfer  of  energy  between  wave  trains  containing  waves  of  wavenum¬ 
bers  ki  and  frequencies  when  =  0  and  £  w,  =0. 

(iii)  Two-  and  three-dimensional,  sub-  and  super-harmonic  instabilities  of  finite  am¬ 
plitude  waves  of  permanent  form: 

The  two-dimension  case  was  studied  by  Longuet-Higgins  [6],  [7j.  Saffman  [8]  and 
MacKay  and  Saffman  [9]  applied  Hamiltonian  methods  and  the  concept  of  signature 
to  explain  qualitative  ly  some  of  the  results,  such  as  the  change  of  stability  when  the 
wave  energy  as  a  furction  of  wave  height  is  stationary  or  modes  collide.  The  three- 
dimensional  case  was  studied  by  McLean,  Ma,  Martin,  Saffman  and  Yuen  [10),  see  also 
McLean  [11],  We  shall  refer  to  this  as  the  MMMSY  instability. 

The  phenomena  in  (i)  and  (iii)  can,  of  course,  be  interpreted  in  terms  of  resonant 
interactions,  but  it  is  useful  to  categorize  them  separately  since  small  disturbances  grow 
at  exponential  rates  rather  than  the  linear  rate  associated  with  the  resonances  of  (ii). 

The  resonances  and  envelope  modulations  are  basically  slowly  varying,  weakly  nonlin¬ 
ear  phenomena,  and  the  modifications  due  to  shear  will  arise  primarily  through  changes 
in  the  dispersion  equations  and  the  speed  of  weakly  nonlinear,  uniform  wave  trains 
(Stokes  waves).  For  example,  Li,  Hui  and  Donelan  [12]  have  calculated  the  effect  of 
constant  water  shear  on  the  side-band  instability  for  the  case  of  infinite  depth,  and 
found  that  small  shear  tends  to  enhance  instability  whereas  large  shear  tends  to  sup¬ 
press  it. 

The  effect  of  shear  on  the  MMMSY,  basically  short  wave  instabilities,  is  a  more 
difficult  problem  and  will  require  massive  computations  in  general,  as  potential  theory 
cannot  be  used  to  relate  perturbations  of  the  flow  to  perturbations  of  the  surface. 
Also,  there  is  no  scalar  stream-function  for  three-dimensional  long  wave  disturbances. 
However,  Benney  and  Chow  [13]  have  used  perturbation  methods  for  weakly  nonlinear 
waves  and  found  that  weak  shear  generates  relatively  strong  three-dimensional  long 
wave  instabilities.  The  properties  of  the  instability  are  significantly  different  from  those 
of  the  Benjamin  and  Feir  type.  For  finite  ampliftude  waves  and  zero  surface  tension, 
Okamura  and  Oikawa  [14]  have  calculated  the  three-dimensional  instability  of  waves 
of  permanent  form  which  were  calculated  by  Teles  da  Silva  and  Peregrine  [15].  The 
disturbance  7/(1,  z,  t)  of  the  free  surface  has  the  form 

OO 

i  oc  e27ri(Px+9z)  J2  Vn  e2xinx.  (1) 

—  OO 

They  recover  the  finite  depth  MMMSY  results  (McLean  [16])  for  class  I  (which  includes 
the  Benjamin  and  Feir  instability  as  the  subharmonic  wave  number  p  tends  to  zero)  and 
class  II  (principally  period  doubling)  disturbances,  as  the  shear  tends  to  zero.  They 
also  see  the  Benney-Chow  long  oblique  waves,  which  are  absent  when  there  is  no  shear. 

The  second  problem  area  is  concerned  with  wave  generation  by  wind  or  current.  In 
particular,  we  can  consider  the  generation  of  waves  as  a  problem  of  the  hydrodynamic 
instability  of  a  uniform  state.  For  this  phenomena,  there  are  three  main  categories. 
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(i)  Kelvin- Helmholtz  (KH)  instability: 

This  is  the  classic  instability  of  a  vortex  sheet  modified  by  a  density  difference  between 
the  fluids  on  the  two  sides  of  the  sheet. 

(ii)  Miles  [17]  instability: 

This  is  the  instability  of  a  shear  flow  in  the  air  over  water  initially  at  rest  which  can 
input  energy  into  waves  whose  phase  speed  lies  between  the  minimum  and  maximum 
of  the  wind  velocity. 

(iii)  Drift  layer  instability: 

This  is  the  hydrodynamic  instability  of  a  current  with  shear  in  the  water,  which  could 
be  produced,  for  example,  by  the  drag  of  the  air  over  the  water  or  mechanically  by 
running  a  stream  over  the  surface,  as  for  example  in  an  estuary.  It  was  investigated 
by  Esch  [18]  for  gravity  waves  on  piecewise  linear  profiles,  by  Stern  and  Adam  [19]  for 
capillary-gravity  waves  on  piecewise  linear  profiles  and  by  Morland,  SafTman  and  Yuen 
[20]  for  capillary-gravity  waves  on  smooth  profiles. 

This  review  will  concentrate  on  these  three  mechanisms  of  wave  generation.  Note 
that  the  discussion  is  limited  to  inviscid  instability  mechanisms,  and  the  direct  effect 
of  viscosity  on  the  wave  evolution  is  neglected.  (Viscosity  may,  of  course,  play  an 
important  role  in  the  generation  of  the  shear  profiles,  but  this  is  assumed  to  be  on  a 
much  longer  time  scale  than  that  of  the  instability.)  Viscosity  is  sometimes  employed 
to  circumvent  analytical  difficulties  associated  with  a  critical  layer;  this  introduces  the 
Orr-Sommerfeld  equation  (e.g.  Valenzuela  [21]  and  subsequent  studies  of  coupled  air- 
water  shear  flows).  But  a  consistent  purely  inviscid  treatment  is  possible  and  the  critical 
layer  can  be  ignored  except  that  it  has  an  indirect  effect  in  requiring  fine  resolution  in 
the  numerical  integrations  when  the  growth  rate  is  small. 

The  further  mechanism  (Phillips  [22]),  in  which  waves  are  generated  by  random  pres¬ 
sure  fluctuations  due  to  a  turbulent  wind  and  the  main  effect  of  shear  is  the  modification 
of  the  dispersion  relation,  will  not  be  considered  here. 

2.  GENERATION  OF  WAVES  BY  WATER  SHEAR 

We  consider  the  instability  of  an  incompressible,  infinitely  deep,  steady  uniform  flow 
u(x,y,  t)  —  U(y ),  v(x,y,t )  —  0.  Here,  x  is  the  horizontal  axis  in  the  direction  of  the 
flow,  y  is  the  vertical  coordinate  (increasing  upwards),  and  u  and  v  are  the  velocity  com¬ 
ponents.  The  undisturbed  free  surface  is  taken  as  y  =  0.  The  air  motion  is  neglected.  It 
is  sufficient  to  consider  two-dimensional  motion,  as  the  inviscid  form  of  Squire’s  theorem 
holds.  That  is,  for  every  unstable  three-dimensional  disturbance  there  is  a  corresponding 
two-dimensional  disturbance  with  larger  growth  rate  (Yih  [23]).  Introducing  a  stream 
function,  we  consider  disturbances  of  the  form 

u'(x,yj)  =  i<f>y(y)/k  v'(x,y,t)  =  <f>(y)  (2) 

It  follows  from  the  Euler  equations,  and  the  boundary  conditions  that  the  pressure 
is  constant  on  the  free  surface  and  the  disturbance  vanishes  at  infinite  depth,  that  <P 
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satisfies 

-  * H  -§~=o- 

(3) 

with  boundary  conditions 

( U  -  c)24>y  -  ( Uy{U  -  c)  +  <7  +  Tk2)  <p  =  Q  on  y  =  0, 

(4) 

and 

cj)  — *  0  as  y  — >  —oo. 

(5) 

Here,  g  is  gravity,  T  is  surface  tension  and  k  is  the  wavenumber  of  the  disturbance. 
This  is  an  eigenvalue  problem  for  the  (possibly  complex)  disturbance  wave  speed  c.  If 
c,  =  Or  ^  0,  then  the  disturbance  grows  at  an  exponential  rate  and  the  flow  is  unstable. 
(The  roots  come  in  complex  conjugate  pairs  so  it  is  sufficient  to  find  a  non-zero  a-)  For 
a  given  shear  profile,  with  Ud  =  U (0)  giving  a  characteristic  speed,  and 

(6) 

a  characteristic  thickness  of  the  drift  layer,  it  follows  that 

c  =  fn  (Ud,  A,gyT,k).  (7) 

The  mathematical  task  is  to  determine  the  values  of  the  parameters  which  make  c 
complex. 

We  know  that  there  are  two  kinds  of  eigenfunction  and  eigenvalue.  There  are  the 
proper  ones,  which  are  associated  with  a  discrete  spectrum.  The  eigenvalues  are  isolated, 
and  the  eigenfunctions  are  non-singular.  In  the  pesent  case,  the  proper  or  discrete 
spectrum  contains  values  of  c  which  are  either  complex,  or  real  with  c  <  0  or  c  >  Ud-  The 
improper  eigenvalues  are  values  of  c  for  which  the  Rayleigh  equation  is  singular.  These 
have  c,  =  0  and  0  <  c  <  Ud.  The  spectrum  is  continuous  and  and  the  eigenfunctions  are 
singular.  The  continuous  spectrum  is  needed  for  the  initial  value  problem  or  analysis 
of  the  Rayleigh  equation,  regarded  as  the  small  viscosity  limit  of  the  Orr-Sommerfeld 
equation,  when  the  proper  spectrum  is  not  complete.  This  is  the  case  here,  where  there 
are  only  two  proper  eigenvalues.  But  the  continuous  spectrum  usually  contributes  a 
disturbance  which  decays  algebraically,  and  is  not  needed  for  considerations  of  stability. 
(An  exception  is  a  vortex  flow  studied  by  Smith  and  Rosenbluth  [24]  for  which  the 
continuous  spectrum  grows  algebraically.)  We  therefore  restrict  attention  to  the  discrete 
spectrum. 

Rayleigh’s  criterion  (see,  e.g.  Drazin  and  Reid  [25]  asserts  that  a  necessary  condition 
for  instability  is  the  existence  of  a  point  of  inflection  in  the  velocity  profile,  i.e.  there 
exists  yc  in  — oo  <  yc  <  0  such  that  Uyy{yc )  =  0.  However,  on  repeating  the  details  of 
the  argument,  it  is  found  that  Rayleigh’s  criterion  does  not  apply  to  this  problem  owing 
to  the  different  boundary  condition  at  the  free  surface. 


A  more  powerful  result,  for  qualitative  analysis  of  the  proper  eigenvalues  is  Howard's 
[26]  semi-circle  theorem,  which  was  shown  by  Yih  [23]  to  be  still  valid  for  the  present 
problem.  According  to  this  result,  the  unstable  proper  roots  of  the  eigenvalue  problem 
lie  in  a  semi-circle  in  the  cr  -  cz  plane  whose  diameter  lies  on  the  real  axis  from  0  to  Uj. 
We  can  now  describe  the  behaviour  of  the  eigenvalues  as  A  =  2n/k  varies  from  oc  to  0. 

When  Uj  =  0  and  there  is  no  shear  in  the  water,  the  dispersion  relation  is  c  —  ±<\, 
where 


Here  Am  =  2ir(T/g)l/"2  and  cm  —  (3//T) 1 //'1  are  the  wavelength  and  phase  speed  of  the 
slowest  capillary-gravity  wave. 

Now  introduce  shear  and  vary  A  keeping  all  other  parameters  constant.  When  A  is 
very  large,  the  effect  of  shear  is  negligible,  being  confined  to  a  relatively  thin  layer,  and 
the  speeds  arc  given  by  (8).  There  are  thus  two  modes,  one  co-flowing  (i.e.  with  velocity 
in  the  direction  of  the  shear  current),  and  the  other  counter-flowing.  As  the  wavelength 
decreases,  the  magnitudes  of  the  speeds  decrease. 

Consider  first  the  co-flowing  wave.  Clearly  its  speed  c  will  be  greater  than  Ud,  as 
shear  convects  the  wave.  Also,  as  A  — »  0,  the  speed  goes  to  infinity  as  the  effect,  of 
capillarity  becomes  dominant.  Therefore  there  must  be  a  value  of  A  such  that  the  speed 
is  a  minimum,  and  this  minimum  speed  is  greater  than  Ud.  Hence,  the  co- flowing  wave 
never  enters  the  Howard  semi-circle,  and  this  mode  is  stable.  As  the  Rayleigh  equation 
never  becomes  singular,  the  mode  is  unique  and  there  are  no  other  members  of  the 
discrete  spectrum  with  c  >  Ud. 

The  situation  for  the  counter- flowing  wave  is  fundamentally  different.  The  ent  rain¬ 
ment  effect  of  the  shear  now  increases  the  wave  speed,  measured  algebraically.  But  when 
A  is  small,  the  wave  speed  goes  back  to  -oc,  and  hence  there  is  a  value  of  A,  Ac  say,  for 
which  the  wave  speed  is  a  maximum,  denoted  by  V,  say.  There  are  now  two  possibilities. 
First,  the  entrainment  of  the  shear  is  smaller  than  crn.  Then  V  <  0  and  c  is  always 
real  and  negative,  and  the  mode  is  stable.  Second,  the  entrainment  is  sufficiently  large 
that  V  >  0.  Then  the  semi-circle  is  entered  and  complex  roots  appear.  However,  as 
A  continues  to  decrease,  the  capillary  effect  eventually  dominates  the  entrainment  and 
the  semi-circle  is  exited,  the  mode  changing  back  from  unstable  to  stable.  Tims,  there 
will  be  a  range  of  wavelengths  which  are  unstable,  the  limiting  values  being  neutrally 
stable  as  the  semi-circle  is  entered  and  left  at  c  =  0.  The  deciding  factors  are  the  values 
of  the  ratios  Ud/cm  and  A/A,„. 

We  now  give  arguments  to  show  that  U,i/cm  is  a  necessary  condition  for  instability, 
i.e.  the  waves  are  always  stable  when  this  condition  is  violated,  further,  when  this 
condition  is  not  satisfied,  unstable  modes  exist  only  when  A  exceeds  a  critical  value 
which  depends  on  Uo/crn. 

The  transition  wave  numbers  .or  which  the  counter-flowing  wave  speed  enters  or 
leaves  the  semi-circle  are  the  eigenvalues  k  of  the  equation 
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with  boundary  conditions 

Uj<t>y  -  ( UdUd,y  +  9  +  k2T )  <j>  =  0  on  y  =  0  (10) 

and  (5).  These  are  obtained  by  putting  c  =  0  in  the  Rayleigh  equation  (3)  and  its 
boundary  condition  (4).  By  inspection,  (9)  and  (10)  can  be  scaled  for  a  given  profile 
in  terms  of  c0jUd  and  A/ A.  Then  in  a  A/ A  -  c0jUd  plane,  the  curve  on  which  c  =  0 
separates  the  plane  into  an  upper  stable  region,  in  which  cv  <  0  and  c,  =  0,  and  a  lower 
unstable  region  in  which  0  <  cv  <  Ud  and  c,  /  0.  The  separating  curve  passes  through 
the  point  c0jUd  —  1  when  A/A  =0.  Now  c0  and  A  are  not  independent,  being  coupled 
by  the  dispersion  equation  (8),  which  is  fixed  when  cm/Ud  and  Am/A  are  known  and 
can  be  written 

Co  cm  l~ A  A  A  Ar  / 1 1 , 

TTd  “  Z/d  V  A  2A^  +  X2A'  (  } 

If  CmjUd  <  1  ,  the  dispersion  relation  intersects  the  stability  boundary  when  A  is 
sufficiently  large,  because  the  minimum  of  the  scaled  dispersion  relation  (11)  moves  to 
the  left  as  A  increases. 

For  the  case  of  an  exponential  profile,  U(y)  =  Ude.2y/i^,  the  c  —  0  neutral  curve  can 
be  determined  analytically,  and  is 


To  obtain  growth  rates,  numerical  methods  must  be  resorted  to  in  general.  However, 
this  is  fairly  straightforward  and  care  is  required  only  when  d  is  very  small.  For  instance, 
the  Rayleigh  equation  (3)  can  be  regarded  as  an  equation  for  complex  k  as  a  function 
of  c.  The  value  of  cr  is  fixed  and  ct  is  varied  until  the  solution  of  a  centred  finite 
difference  approximation  to  the  differential  equation  and  boundary  conditions  gives  k 
real.  Continuation  methods  can  be  employed  to  find  further  solutions.  The  critical  layer 
is  unimportant  in  the  calculation,  except  for  the  need  to  employ  a  fine  mesh  near  the 
value  of  y  for  which  U(y)  —  cr  when  d  is  particularly  small. 

When  T  —  0,  the  locus  in  the  semi-circle  is  qualitatively  different.  In  this  case,  the 
co-flowing  waves  are  again  always  stable,  but  their  speed  tends  to  Ud  as  A  — >  O.The 
counter-flowing  waves  are  stable  until  their  wavelength  decreases  to  a  value  Ac  at  which 
they  are  stationary.  For  A  <  Ac,  the  entrainment  by  the  shear  makes  the  speed  positive, 
and  the  waves  are  unstable  until  A  =  0,  when  cr  has  reached  the  value  Ud-  For  the 
exponential  profile, 


Ac  ir'Ujf  g*\~l 

A2  A5  V  4 U\) 


(13) 


The  mechanism  for  the  generation  of  waves  by  drift  layer  instability  is  qualitatively 
attractive.  It  provides  a  delay  mechanism  for  the  wave  generation  after  a  wind  starts 
blowing,  because  the  instability  mechanism  will  not  work  until  the  layer  thickness  A 
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reaches  a  critical  value.  But  quantitative  predictions  are  very  poor.  The  necessary 
condition  Uj  >  cm  is  not  satisfied  by  the  wind  drift  layer,  and  the  predicted  phase 
speeds  are  of  order  |  c,„  which  is  much  too  fast.  On  the  other  hand,  the  theory  may  be 
applicable  to  estuary  discharge  (J.  Jimenez,  private  communication)  or  tidal  flow  (D.W. 
Moore,  private  communication)  in  the  presence  of  wind.  An  onshore  wind,  for  example, 
would  tend  to  amplify  counter  flowing  waves  on  an  estuary  current  or  ebbing  tide  which 
would  be  unstable  and  produce  choppiness,  whereas  waves  would  not  be  supported  with 
an  onshore  wind  and  a  rising  tide  and  the  surface  would  remain  relatively  smooth. 

An  interesting  question,  currently  under  study,  is  the  behaviour  of  the  instability  in 
the  presence  of  a  stable  long  wave.  This  is  the  MMMSY  problem,  for  the  case  in  which 
a  superharmonic  disturbance  is  unstable.  For  capillary  gravity  waves,  it  is  possible  that 
there  might  be  enhanced  instability  because  water  shear  causes  extra  corrugation  of  the 
interfaces  (Milinazzo  and  SafFman  [27]). 

3.  EFFECT  OF  WIND  SHEAR  ON  WAVE  GENERATION 


We  now  consider  the  case  of  shear  in  the  air  and  not  in  the  water.  We  neglect 
surface  tension,  and  again  suppose  the  flow  is  laminar  and  inviscid.  In  this  case,  the 
eigenfunction  4>(y)  for  two-dimensional  waves  of  wavenumber  k  satisfies  the  Rayleigh 
equation  (3)  for  0  <  y  <  oo,  where  U(y)  now  denotes  the  shear  profile  in  the  air.  In  the 
water,  -oo  <  y  <  0,  the  Rayleigh  equation  reduces  to 

<t>yy-k2(t>  =  0.  (14) 

Further,  (f>  — »  0  as  y  —> >  ±oo,  and  at  the  interface  y  =  0, 

\HW  -  c)2  -  (U,  (V  -  c)  +  g)  =  0.  (15) 

These  equations  again  constitute  an  eigenvalue  problem  for  the  wave  speed  c. 

Miles  [17]  and  Lighthill  [28]  demonstrate  the  existence  of  growth  by  a  quasi-steady 
analysis  of  the  critical  layer  for  waves  whose  wavenumber  is  such  that  0  <  cw  <  U^. 
where  cw  =  y/g/k.  they  give  the  formulae 


rP  a 

c,=— .  /?  = 


w  2  d  (; 


where  r  —  pa/pw ,  £  =  ky,  w  —  U(y)  —  cw,  and  a  prime  denotes  differentiation  with 
respect  to  £.  A  subscript  c  indicates  evaluation  at  the  critical  layer  £c  =  kyc,  which  is 
defined  by  U(yc)  ~  cw.  If  cw  does  not,  lie  between  the  maximum  and  minimum  values 
of  U(y),  then  ft  is  defined  to  be  zero.  We  shall  refer  to  (16)  as  the  Miles  formula.  In 
the  case  of  exponential  profiles  in  the  air, 

U  =  Urx,(l  ~e~^/A),  (17) 

the  expression  for  0  can  be  evaluated  in  closed  form  to  yield 


kd  - 


16  irrk'^AU^. 
cw(2  +  kA)2{4  +  kA)i 
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It  can  be  shown  that  Squire’s  theorem  still  holds  (see  Morland  and  SafFman  [29]),  and 
it  is  therefore  sufficient  to  consider  two-dimensional  disturbances. 

As  for  the  case  of  shear  in  the  water,  Howard’s  semi-circle  theorem  still  holds  and 
combined  with  straightforward  numerical  methods  eliminates  the  need  for  a  critical 
layer  analysis.  For  a  bounded  flow  (i.e.  U0 0  <  oc)  such  as  the  exponential  profile 
(17),  or  an  error  function  profile  U  =  erf(2y/v/i:A),  there  are  two  modes  for  infinite 
wavelength.  As  A  decreases,  the  speed  of  the  co-flowing  wave  decreases  until  at  a  critical 
wavelength  Ac,  the  wave  enters  the  semi-circle  at  cT  ~  Uoo-  As  A  now  tends  to  zero,  the 
locus  is  inside  the  semi-circle  and  the  co-flowing  wave  is  unstable  for  0  <  A  <  Ac.  On 
the  other  hand,  the  speed  of  the  counter-flowing  wave  is  always  negative  and  approaches 
zero  as  A  — »  0.  These  waves  are  therefore  always  stable.  Outside  the  semi-circle,  the 
phase  speed  is  given  approximately  by  cv  =  ±cUJ(Ac)  «  !/«>  ;  which  is  to  be  expected  as 
the  air  is  just  a  small  pert'irbation  of  the  water  motion. 

The  details  of  the  complex  wave  speeds  in  the  semi-circle  depend  on  the  values  of  r 
(«  10~3  for  the  air-water  interface)  and  A g/U^  ,  and  on  the  profile.  If  A g/U^  >  r, 
then  cr  «  cw{ A)  inside  the  semi-circle.  For  Ag/U^  —  0.1,  fee*  ~  5  x  10— a5?/t/00 .  For 
A g/U^c  —  0.01,  kei  ~  0.2g/Uoo.  The  growth  rates  for  these  cases  given  by  the  Miles 
formula  (16)  are  different  by  a  factor  about  2  or  1/2  depending  on  the  profile. 

If  A  —  0,  the  shear  in  the  air  condenses  to  a  vortex  sheet,  and  the  instability  mech¬ 
anism  is  the  classic  KH  instability  (see  Lamb  [30]).  The  condition  for  the  instability  of 
the  smooth  profile  to  be  of  this  class  is  A g/U^  <C  2nr.  The  complex  wave  speed  for 
the  KH  instability  is 

c  ~  rU^  ±  y/Xg/2-n  -  rU*, .  (19) 

These  values  are  retrieved  by  the  numerical  computations  when  A g/U^  —*  0  with 
A  >  A.  In  this  limit,  the  approximation  cv  ~  cw  becomes  invalid  for  the  unstable 
modes  since  from  (19)  their  phase  velocity  is  approximately  constant  with  value  rU <X). 
The  KH  instability  becomes  significant  for  smooth  profiles  when  A  is  small  enough  that 
waves  long  compared  to  A  are  KH  unstable. 

Unbounded  profiles,  e.g.  lin-log  profiles  in  which  U  oc  y  for  y  <  y,  and  U  oc  logy 
for  y  >  y*,  where  y*  is  the  width  of  the  viscous  sub-layer,  are  of  interest  as  models 
of  turbulent  flow.  In  this  case,  the  semi-circle  occupies  the  whole  positive  cT  axis, 
and  all  wavelengths  are  unstable.  However,  the  numerics  is  still  straightforward.  It 
is  found  that  the  numerical  results  agree  qualitatively  with  those  calculated  from  the 
Miles  formula  (16)  provided  \^>  u\/g,  where  u,  is  the  friction  velocity.  This  condition 
ensures  that  the  critical  layer  lies  outside  the  viscous  sublayer  so  that  the  Miles  formula 
is  applicable.  For  further  details,  see  Morland  and  SafFman  [28].  However,  there  is 
a  significant  quantitative  difference,  with  the  Miles  formula  (16)  overestimating  the 
growth  rate  obtained  from  numerical  solution  by  a  factor  of  two. 

When  there  is  shear  in  both  vind  and  water,  the  problem  becomes  more  difficult.  A 
preliminary  analysis  using  a  piecewise  linear  profile  indicates  that  new  phenomena  may 
be  present.  The  advantage  of  such  a  profile  is  that  the  dispersion  relation  is  obtained 
from  solving  a  quart ic,  and  the  dependence  on  the  various  parameters  is  more  easily 
examined  than  is  the  case  for  smooth  profiles.  But  there  is  still  the  difficulty  of  making 
sense  of  the  results  when  there  are  many  parameters  in  the  problem.  The  profile  used 


14) 


is 


U(y)  =  l 


( 0 

Vw  +  Vwy/&w 

Vw  +  Vay/Aa 

\.vw  +  va 


if  y  <  Aw~, 
if  -Aw  <  y  <  0; 
if  0  <  y  <  Aa; 
if  y  >  Aa. 


(20) 


The  main  conclusion  from  an  initial  study  is  that  water  shear  tends  to  stabilize  the 
Miles  type  instability,  and  extends  the  boundary  of  the  region  where  KH  instability  is 
seen.  Also,  there  is  a  range  of  windspeeds  for  which  waves  with  a  wavelength  in  a  range 
do  not  grow,  i.e.  different  wave  components  behave  differently  under  a  given  wind  in 
the  presence  of  water  shear.  For  further  details,  see  Caponi  et  al  [31]. 
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Abstract 

Living  tissue  adapts  to  its  environmental  mechanical  loading  on  a  daily 
basis.  The  adaptation  begins  within  seconds  of  a  significant  loading  change 
and,  while  the  change  can  take  months  in  bone,  it  can  happen  in  weeks  in  soft 
tissues.  Wolff  s  law  is  one  of  the  observations  of  functional  adaptation  of  living 
tissue  to  mechanical  loading.  It  is  the  observation  that  the  axes  of  the 
individual  bony  struts  in  the  cancellate  (spongy  or  porous)  bone  coincide  with 
the  principal  axes  of  stress  induced  by  the  external  loads  applied  to  the  whole 
bone.  After  a  brief  historical  introduction  to  the  subject,  two  topics  of  current 
research  concerning  Wolffs  law  are  described.  The  first  topic  is  a 
phenomenological  continuum  model  whose  objective  is  to  describe  the  temporal 
evolution  of  both  the  density  changes  and  the  reorientation  of  the  anisotropic 
microarchitecture  when  the  stress  state  applied  to  a  bone  is  changed.  The 
second  topic  is  the  modeling  of  a  possible  cellular  mechanosensory 
transduction  mechanism  by  which  communicating  bone  cells  (osteocytes) 
sense  the  very  small  in  vivo  strains  in  the  bone  and  signal  other  bone  cells  to 
build  bone  (osteoblasts)  or  to  remove  bone  (osteoclasts). 

1.  HISTORICAL  BACKGROUND 

The  history  of  development  of  what  is  today  called  Wolff  s  law  begins  at  a 
meeting  of  a  Natural  Science  Society  in  Zurich  held  exactly  126  years  ago,  in 
August  of  1866.  In  attendance  were  Professor  H.  Meyer,  an  anatomist,  and 
Professor  K.  Culmann  who  was  Professor  of  Engineering  Science  at  the  newly 
created  Swiss  Federal  Technical  Institute  (ETH)  in  Zurich.  Professor  Culmann 
was  a  leading  structural  engineer  and  his  book  [1]  on  graphical  statics,  the 
then-current  standard  procedure  for  structural  stress  analysis,  appeared  that 
year.  He  is  credited  with  the  creation  of  what  we  now  call  the  two-dimensional 
Mohr's  circle  (i.e.,  Mohr  extended  the  Culmann  circle  to  three  dimensions).  At 
the  meeting  Meyer  made  a  presentation  in  which  he  displayed  preparations 
and  drawings  of  the  trabecular  architecture  seen  in  a  frontal  section  of  the 
pelvic  end  of  the  human  thigh  bone  as  shown  in  Figure  1.  Note  that  a  strut  of 
the  cancellate  (spongy  or  porous)  bone  is  called  a  trabecula,  from  the  Latin  for 
“little  beam.”  Culmann  observed  that  the  form  of  the  trabeculae  in  the  pelvic 
end  of  the  thigh  bone  closely  resembled  the  form  of  the  stress  flow  lines  (i.e., 
stress  trajectories)  in  an  object  of  the  same  shape  as  the  bone  and  loaded  in  the 
same  way.  He  made  that  observation  based  on  his  experience  with  a  crane 
design  that  was  similar  in  shape  to  the  frontal  section  of  a  human 
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Figure  1.  On  the  left  is  Culmann's  crane  with  the  principal  stress  trajectories  indicated.  On 
the  right  is  von  Meyer's  sketch  of  the  trabecular  architecture  in  a  section  through  the  proximal 
end  of  the  human  femur.  Both  the  femur  and  Culmann's  crane  are  loaded  transversely  at  their 
cantilevered  ends  as  illustrated  on  the  little  insert  at  the  lower  far  right.  Taken  from  Wolff  [2]. 

thigh.  Figure  1,  taken  from  an  1870  paper  [2]  of  Wolff  s,  illustrates  the 
relationship  between  trabecular  architecture  and  the  stress  flow  lines  of 
Culmann  s  crane.  The  coincidence  of  trabecular  directions  with  the  directions 
of  the  stress  flow  lines  is  called  Wolffs  law.  The  interaction  between  Meyer  and 
Culmann  is  considered  by  many  to  be  the  origin  of  what  is  perceived  to  be 
Wolff's  law.  The  name  of  Julius  Wolff,  an  anatomist  and  a  contemporary  of 
Meyer  and  Culmann,  is  associated  with  the  observation  because  Wolff  wrote 
voluminously  about  it,  Wolff  [3,  4].  The  meeting  of  Meyer,  the  anatomist,  and 
Culmann,  the  engineer,  was  an  important  historical  event,  the  apparent 
genesis  of  the  important  biological  concept  of  structural  adaptation.  Culmann’s 
involvement  with  the  ideas  of  structural  adaptation  in  bone  was  a  minor 
incident  in  a  highly  successful  professional  engineering  career.  Culmann  did 
not  publish  any  material  on  this  topic,  although  he  published  a  great  deal  on 
graphical  statics  and  a  very  influential  1851  article  on  the  latest  advances  in 
bridge,  railway  and  river-boat  construction  in  England  and  the  United  States, 
Charlton  [5]. 

With  the  observation  of  Culmann  as  his  basis,  Wolff  [2]  in  1870  stated 
that,  when  the  environmental  loads  on  a  bone  are  changed  by  trauma, 
pathology  or  change  in  life  pattern,  functional  remodeling  reorients  the 
trabeculae  so  that  they  align  with  the  new  principal  stress  trajectories.  Wolff 
never  attempted  to  prove  this  assertion.  In  1885,  Roux  [6],  who  broadly 
generalized  the  concept  of  functional  adaptation,  produced  an  analysis  of  the 
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Figure  2.  Left:  Pauwels'  [12]  sketch  of  the  cancellous  architecture  in  the  ankylosed  knee  he 
considered.  Right:  Pauwels'  sketch  of  the  principal  stress  trajectories  in  the  photoelastic  model 
he  constructed.  The  two  transverse  dotted  lines  in  the  photoelastic  model  are  the  boundaries  of 
the  anatomically  correct  model  of  the  knee. 

structure  of  the  cancellous  tissue  of  an  ankylosed  knee  as  a  proof  of  Wolffs  law. 
The  development  of  the  theory  of  the  trajectorial  architecture  of  cancellous  bone 
appears  to  have  slowed  after  the  1885  book  of  Roux  because  the  theory  had 
developed  as  far  as  it  could  with  the  then  current  techniques  for  stress 
analysis. 

In  1954  Pauwels  [7]  examined  another  ankylosed  knee  using  two- 
dimensional  photoelastic  models  to  estimate  the  principal  stress  trajectories  in 
the  knee.  Figure  2(a)  shows  Pauwels'  sketch  of  the  cancellous  architecture  of 
the  ankylosed  knee  and  Figure  2(b)  shows  the  stress  trajectories  in  his 
photoelastic  model  of  the  situation.  Pauwels  indicates  that  this  analysis  is  a 
proof  of  Wolffs  hypothesis. 

An  increased  research  interest  in  the  trajectorial  theory  of  bone 
architecture  has  occurred  in  recent  years.  This  increased  research  interest  is 
driven  by  two  synergistic  factors.  First,  there  is  the  clinical  need  to  understand 
the  mechanical  and  remodeling  behavior  of  bone  tissue  because  the  use  of 
implanted  bone  prostheses  has  increased  very  greatly.  Prostheses  transfer  joint 
forces  to  the  bone  cortex  via  the  trabecular  network.  Second,  scientific  and 
engineering  experimental  techniques  developed  in  the  physical  and  biological 
sciences  and  in  engineering  in  the  last  thirty  years  now  supply  the  technology 
that  blocked  further  development  in  the  1880's.  For  example,  Hayes  and  Snyder 
[8]  employed  quantitative  slereology  techniques  and  a  two-dimensional  finite 
element  mesh  to  study  the  trabecular  architecture  of  the  patella  and  concluded 
that  the  trabecular  structure  aligns  itself  with  principal  stress  trajectories. 
Little  et  al.  [9]  reported  a  three-dimensional  finite  element  analysis  of  the  upper 
tibia  and  made  drawings  of  the  stress  trajectory  patterns.  They  concluded  that 
these  patterns  are  very  similar  to  the  orientations  of  the  trabeculae  in  the 
section  views  of  actual  tibiae. 

The  underlying  idea  of  the  studies  of  Little  et  al.  [9]  and  Hayes  and 
Snyder  [8]  is  the  same  as  that  that  underlay  the  analysis  of  Roux  and  of 
Pauwels,  namely,  the  verification  of  Wolffs  trajectorial  architecture  hypothesis 
by  comparison  of  the  stress  trajectories  predicted  by  elastic  stress  analysis  with 
the  actual  trabecular  trajectories.  Roux  used  rubber  models  in  the  shape  of  the 
ankylosed  knee  and  noted  the  deformation  patterns  of  inscribed  lines  on  the 
surface  of  the  rubber  when  it  is  loaded  as  a  knee  is  loaded.  Pauwels  used  the 
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photoelastic  stress  analysis.  Unlike  the  experimental  models  of  Roux  and 
Pauwels,  the  finite  element  technique  can  be  applied  to  the  stress  analysis  of 
objects  of  any  shape  and  of  any  degree  of  anisotropy  and  inhomogeneity,  thus 
permitting  cortical  bone  to  be  distinguished  from  cancellous  bone,  and 
cancellous  bone  tissue  of  different  densities  and  different  elastic  constants  to  be 
distinguished  in  the  model. 

2.  THE  PHENOMENOLOGICAL  CONTINUUM  AND  MECHANOSENSORY 
TRANSDUCTION  MECHANISM  MODELS 

A  phenomenological  continuum  model  for  the  temporal  evolution  of  both 
the  density  changes  and  the  anisotropic  microstructure  of  cancellous  bone  is 
described  in  the  next  three  sections.  In  this  model  a  quantitative  stereological 
measure  of  the  microstructural  arrangement  of  trabeculae  and  pores  of 
cancellous  bone,  called  the  fabric  tensor,  is  related  to  the  elastic  constants  of  the 
tissue.  This  formulation  of  the  elastic  constitutive  equation  in  terms  of  the 
fabric  tensor  leads  to  a  mathematical  statement  of  Wolff's  law  at  remodeling 
equilibrium.  The  fabric  tensor  and  its  experimental  determination  are 
discussed  in  the  following  section.  The  relationship  between  the  elastic 
constants  and  the  fabric  tensor  in  a  local  region  of  cancellous  bone  is  described 
in  a  subsequent  section.  A  mathematical  formulation  of  Wolff s  law  of 
trabecular  architecture  at  remodeling  equilibrium  is  sketched  in  the  fifth 
section. 

The  remainder  of  this  paper  concerns  a  cellular  level  mechanosensory 
transduction  mechanism  for  Wolffs  law.  A  new  hypothesis  is  advanced  for  the 
cellular  level  feedback  mechanism  associated  with  Wolff s  law.  The  hypothesis 
concerns  the  mechanism  by  which  the  osteocytes  housed  in  the  lacunae  of 
mechanically  loaded  bone  sense  the  mechanical  load  applied  to  the  bone  by  the 
detection  of  cyclic  dynamic  strains.  The  model  incorporating  this  hypothesis 
employs  a  hierarchical  model  of  bone  tissue  structure  as  the  medium  for  the 
conversion  of  cyclic  mechanical  loading  applied  to  the  whole  bone  to  the  fluid 
shear  stress  at  the  surface  of  the  osteocytic  cell  process.  The  proposed 
mechanosensory  model  is  described  in  section  six  and  the  hierarchical 
anatomical  and  permeability  model  levels  are  detailed  in  the  next  two  sections. 
In  the  final  section  the  predictions  of  the  mechanosensory  model  and  their 
significance  are  described. 

3.  FABRIC 

It  is  recognized  that  porosity  or  solid  volume  fraction  is  the  primary 
measure  of  local  material  structure  in  a  porous  material.  Porosity  does  not, 
however,  reflect  any  directionality  of  the  specimen's  microstructure.  The  faDric 
tensor  is  constructed  to  represent  the  orientation  of  the  local  material 
microstructure  in  many  porous  materials,  [10-16],  The  term  fabric  tensor  is 
used  here  to  indicate  any  positive  definite  second-rank  tensor  in  three 
dimensions  that  characterizes  the  local  anisotropy  of  the  material's 
microstructure.  The  fabric  tensor  is  a  point  property  (even  though  its 
measurement  requires  a  finite  test  volume)  and  is  therefore  considered  to  be  a 
continuous  function  of  position  in  the  material  model.  Methods  exist  to 
measure  fabric  ellipsoids  in  cellular  materials,  cancellous  bone,  rocks  and 
granular  materials  and  are  described  in  [10-16]. 


One  rather  general  way  to  construct  a  fabric  tensor  for  a  material  is  from 
a  set  of  N  measurements  of  material  microstructural  features,  each 
measurement  characterized  by  a  scalar  mn  and  a  unit  vector  cn  (c"  c“=  1), 
where  n  =  1,  2,...N.  The  normalized  fabric  tensor  H  is  defined  in  terms  of  the  N 
observat  ons  as, 

N  N 

H  X  mn=  X  mncn®cn,  (1) 

n  =  1  n  =  i 

where  the  encircled  cruciform  indicates  the  open  or  tensor  product  of  cn  with 
itself.  Due  to  the  normalization,  trH  =  1,  since  cn-cn=  1  for  all  N. 

The  experimental  procedure  for  the  measurement  of  cancellous  bone 
fabric  is  described  by  Whitehouse  [15],  [16],  Harrigan  and  Mann  [12]  and 
Turner  et  al.  [13].  It  consists  of  determining  the  mean  distance,  mn,  between 
the  change  of  phases,  bone  to  void  and  vice  versa,  along  each  test  direction  cn. 
The  work  of  these  authors  has  shown  that  the  fabric  tensor  is  a  good  measure 
of  the  structural  anisotropy  in  cancellous  bone  tissue.  The  methodology  of 
making  measurements  is  easily  adapted  to  an  automated  computational 
system  as  shown  by  Harrigan  and  Mann  [12],  Turner  et  al.  [13]  and 
Hodgskinson  and  Currey  [14].  The  determination  of  H  for  the  cancellous  bone 
from  a  horse  is  illustrated  in  Figure  3,  which  is  taken  from  [14], 

4.  WOLFF'S  LAW  AT  REMODELING  EQUILIBRIUM 

A  continuum  formulation  of  Wolffs  law  at  remodeling  equilibrium  (RE) 
was  suggested  by  Cowin  [17].  This  formulation  is  based  on  an  elastic- 
constitutive  relation  for  cancellous  bone  in  which  the  elastic  coefficients  depend 
on  the  solid  volume  fraction  of  bone  as  well  as  the  fabric  tensor.  The  constitutive 
equation  for  this  continuum  formulation  is 

T  =  C(H,  v)[E]  ,  or  Ty  =  Qj]:m(H,  v)Ekm  ,  (2) 

where  T,  H,  E  are  the  stress,  fabric  and  strain  tensors  whose  Cartesian 
components  are  Tijf  Hjj  and  Ejj,  respectively,  C(H,  v)  is  the  fourth  rank 
elasticity  tensor  whose  components  are  Cjjkm.  and  v  is  the  solid  volume 
fraction  of  the  trabecular  tissue.  The  quantity  C[E]  is  the  second  rank  tensor 
with  components  CijkmEkm-  It  is  assumed  that  all  the  structural  anisotropy  of 
the  material  is  characterized  by  the  fabric  tensor  H;  this  assumption  can  be 
shown  to  require  that  the  material  have  crthotropic  symmetry,  [19].  The 
absence  of  bone  remodeling  is  remodeling  equilibrium.  Specifically, 
remodeling  equilibrium  (RE)  is  the  set  of  conditions  under  which  there  is  no 
realignment  of  trabecular  architecture  and  no  net  deposition  or  resorption  of 
cancellous  bone  tissue.  RE  is  thus  characterized  by  a  particular  equilibrium 
architecture,  denoted  by  v*  and  H*,  and  a  particular  stress  and  strain  state, 
denoted  by  T*  and  E*,  respectively.  The  RE  stress  T*  and  strain  E*  may 
actually  be  ranges  of  stress  and  strain,  or  some  special  measures  of  stress  and 
strain  history.  This  matter  is  discussed  by  Cowin  [ 20 j.  It  is  reasonable  to  think 
ofT*  and  E*  as  long  time  averages  of  the  environmental  stress  and  strain, 
although  they  may  be  other  functions  of  the  stress  and  strain  history  of  the  bone 


Figure  3.  The  fabric  ellipses  for  cancellous  bone;  faces  of  two  cubes  of  cancellous  horse  bone. 


tissue. 

Wolff  s  law  of  trabecular  architecture  is  the  observation  that  the 
principal  stress  axes  coincide  with  the  principal  trabecular  directions  in 
cancellous  bone  in  RE.  This  means  that,  in  RE,  the  principal  axes  of  stress  T* 
must  coincide  with  the  principal  axes  of  fabric  H*.  This  coincidence  of 
principal  axes  is  assured  if  the  matrix  multiplication  of  T*  and  H*  is 
commutative,  i.e.,  if 

T*H*  =  H*T*.  (3) 

This  equation,  which  is  a  simple  statement  of  the  commutability  of  the  matrix 
multiplication  T*  and  H*,  can  be  viewed  as  an  algebraic  statement  of  the 
phenomenological  Wolffs  law.  If  the  principal  axes  of  T*  and  H*  are 
coincident,  it  is  shown  in  [17]  that  the  principal  axes  of  E*  are  also  coincident 
with  those  of  T*  and  H*,  thus 

T*H*  =  H*T*,  E*H*  =  H*E*  T*E*  =  E*T*,  (4) 

the  principal  axes  of  T,  E  and  H  are  all  coincident  or  they  are  all  distinct.  Since 
the  eigenvectors  of  H  are  the  axes  of  material  symmetry  for  orthotropy,  this 
coincidence  of  the  principal  axes  of  T,  E  and  H  is  equivalent  to  the  observation 
that  the  principal  axes  of  T  and  E  coincide  in  an  orthotropic  material  only  when 
the  axes  of  T  and  E  are  coincident  with  the  axes  of  material  symmetry. 
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Figure  4.  An  illustration  of  the  formulation  of  the  evolutionary  fabric  problem  considered,  a.) 
The  remodeling  equilibrium  situation  associated  with  a  stress  state  T°  that  exists  for  t  <  0.  b.) 

The  situation  at  t  =  0+  when  a  new  stress  state  T*  is  applied  and  held  constant  for  all  subsequent 
time. 


5.  AN  EVOLUTIONARY  WOLFFS  LAW 

A  constitutive  hypothesis  to  describe  the  temporal  evolution  of  trabecular 
architecture,  an  extension  of  the  formulation  of  Wolff s  law  at  remodeling 
equilibrium  described  above,  was  put  forward  by  Cowin  et  al.  [21],  That 
constitutive  hypothesis  will  be  described  here,  without  equations.  This  is 
accomplished  by  describing  what  the  theory  will  predict  in  a  particular, 
spatially  homogeneous  evolutionary  fabric  problem. 

The  evolutionary  fabric  problem  considered  is  a  situation  in  which  a 
spatially  homogeneous  trabecular  architecture,  characterized  by  the  fabric  H° 
and  a  solid  volume  fraction  v0,  is  in  RE  under  the  stress  T°  and  the  strain  E°  for 
a  long  time  (-«  <  t  <  0).  At  time  t  =  0+  the  stress  is  changed  to  a  new  stress  T* 
and  held  at  the  new  stress  for  all  time  (0  <  t  <°°).  As  time  becomes  very  large  a 
new  RE  state  denoted  by  T*,  H*,  E*  and  v*  is  achieved.  The  two  RE  states  of  the 
problem  are  thus  characterized  as  follows:  the  one  that  exists  before  t  =  0  is 
characterized  by  T°,  E°,  H°  and  v0;  and  the  one  that  exists  as  time  becomes  very 
large,  is  characterized  by  T*,  E*,  H*  and  v*.  These  two  stress  states  are 
illustrated  in  Figure  4. 

The  fabric  and  strain  tensors  as  they  evolve  in  time  between  these  two  RE 
states  are  denoted  by  H(t)  and  E(t).  The  various  coordinate  systems  employed 
are  illustrated  in  Figure  5.  Figure  5  is  a  sequence  of  four  illustrations  showing 
the  temporal  evolution  of  the  trabecular  architecture  from  an  RE  situation 
characterized  by  states  of  stress,  strain  and  fabric  denoted  as  T°,  E°  and  H°, 
respectively,  to  an  RE  situation  characterized  by  states  of  stress,  strain  and 
fabric  denoted  as  T*,  E*  and  H*,  respectively.  Figure  5a  illustrates  the 
remodeling  equilibrium  situation  that  exists  for  t  <  0.  The  principal  axes  of  T°, 
E°  and  H°  are  coincident.  Figure  5b  illustrates  the  situation  when  remodeling 

is  initiated  at  t  =  0+.  The  applied  stress  is  now  represented  by  the  temporally 


Figure  5.  This  sequence  of  four  illustrations  shows  the  evolution  of  the  trabecular  architecture 
from  an  RE  situation  characterized  by  states  of  stress,  strain  and  fabric  denoted  as  T°,  E°  and 
H°,  respectively,  to  an  RE  situation  characterized  by  states  of  stress,  strain  and  fabric  denoted 
as  T*,  E*  and  H*,  respectively.  The  clockwise  sequence  starting  from  the  upper  left  hand 
corner,  (a),  (b),  (c)  and  (d)  is  described  in  the  text. 

constant  stress  tensor  T*,  whose  principal  axes  make  an  angle  x  with  the 
principal  axes  of  the  previously  applied  stress  tensor  T°.  Furthermore,  the 
temporally  varying  strain  tensor  E(t)  has  instantaneously  assumed  a  new 
value  while  the  fabric  tensor  H°  remains  at  its  t  <  0  value.  Figure  5c  illustrates 
the  situation  at  a  typical  time  t  as  remodeling  is  proceeding.  The  angles  that 
the  principal  axes  of  strain  and  fabric  make  with  the  horizontal  axes  are 
denoted  by  e(t)  and  K(t),  respectively.  As  time  increases,  the  principal  axes  of 
strain  and  fabric  move  into  coincidence  with  the  principal  axes  of  the  applied 
stress.  Figure  5d  illustrates  the  new  remodeling  equilibrium  situation  which  is 
achieved  as  t  tends  to  infinity.  A  new  RE  situation  has  been  established;  it  is 
characterized  by  states  of  stress,  strain  and  fabric  with  coincident  principal 
axes  denoted  as  T*,  E*  and  H*,  respectively,  and  by  a  solid  volume  fraction  v*. 

The  model  just  described  is  necessarily  non-linear.  This  non-linearity  is 
fundamental  in  that  it  stems  from  the  fact  that,  during  remodeling,  the 
relationship  between  stress  and  strain  is  changing  as  the  stress  and  strain 
variables  themselves  are  changing.  In  order  to  preserve  the  remodeling 
property  of  the  model,  terms  that  are  of  the  order  strain  times  the  changes  in 
density  and/or  microstructural  properties  must  be  retained.  If  these  terms 
were  dropped,  there  would  be  no  feedback  mechanism  for  architectural 
adaptation  and  no  adaptation  of  the  trabecular  architecture.  There  is, 
therefore,  no  linearized  version  of  this  model  of  the  temporal  evolution  of 


trabecular  architecture.  A  significant  limitation  of  the  proposed  continuum 
model  is  the  length  scale  below  which  it  cannot  be  applied,  about  5 mm,  because 
of  the  inhomogeneity  of  the  porous  structure.  This  precludes  application  of  the 
model  at  a  bone-implant  interface. 

6.  MECHANOSENSORY  MECHANISM 

The  continuum  model  considered  above  is  described  as 
“phenomenological”  because  it  has  as  its  objective  a  description  of  the  observed 
phenomena,  without  addressing  the  mechanism  that  controls  the  phenomena. 
The  remainder  of  this  paper  addresses  this  shortcoming  by  describing  a  model 
for  a  cellular  level  mechanosensory  transduction  mechanism  for  Wolffs  law. 
This  is  a  new  hypothesis  for  the  cellular  level  feedback  mechanism,  recently 
advanced  by  Weinbaum  et  al.  [22].  The  hypothesis  is  that  the  mechanical  loads 
applied  to  a  whole  bone  cause  the  movement  of  bone  water  which,  in  turn, 
causes  shear  stresses  on  the  bone  cells  encased  in  the  whole  bone;  the  bone 
cells  sense  these  stresses  and  communicate  with  other  bone  cells  to  deposit  or 
resorb  bone  tissue.  The  bone  cells  that  sense  the  fluid  movement  are  the 
osteocytes  housed  in  the  lacunae.  The  strains  that  are  detected  are  cyclic 
dynamic  strains.  The  model  incorporating  this  hypothesis  employs  a 
hierarchical  model  of  bone  tissue  structure  as  the  medium  for  the  conversion  of 
cyclic  mechanical  loading  applied  to  the  whole  bone  to  the  fluid  shear  stress  at 
the  surface  of  the  osteocytic  cell  process. 

There  are  three  levels  of  scale  in  this  model.  The  smallest  scale  level  is 

the  spacing  A  (approximately  7nm)  between  the  proteoglycan  fibers  on  the 
membrane  surface  of  the  osteocytic  process.  The  middle  scale  level  is  the  gap 
(approximately  0.1  pm)  between  the  wall  of  a  canaliculus  and  the  surface  of  the 
cell  process  in  the  canaliculus.  The  largest  scale  level  is  a  unit  cell 
(approximately  a  cubic  voiume  40  pm  on  each  edge)  containing  a  lacuna  and 
half  the  length  of  all  the  canaliculi  associated  with  the  lacuna.  Using  viscous 
fluid  theory,  the  Brinkman  approximation  for  fluid  flow  through  porous  media 
and  the  Biot  theory  -  the  theory  of  stress-induced  flow  of  interstitial  fluid  in 
porous  solids  -  we  relate  the  fluid  shear  stress  at  the  osteocytic  surface  to  the 
mechanical  load  applied  to  the  bone. 

7.  HIERARCHICAL  MICROANATOMICAL  MODEL  LEVELS 

The  purpose  of  this  section  is  to  describe  the  three  hierarchical 
anatomical  levels  of  the  model.  An  anatomical  illustration  of  a  lacuna  housed 
osteocyte  and  its  cytoplasmic  processes,  taken  from  page  225  of  Krstic  [23],  is 
shown  in  Figure  6.  A  schematic  model  of  the  lacunar-canalicular  porosity 
associated  with  the  fluid  space  surrounding  the  osteocytes  in  a  trabecula  is 
illustrated  in  Figure  7.  An  idealized  model  of  the  cross  section  of  a  canaliculus 
is  shown  in  Figure  8.  This  figure  illustrates  the  details  of  the  fluid  annulus  of 
the  canaliculus  with  its  cell  process  and  fiber  matrix  structure.  While  Figure  7 
is  on  the  length  scale  L  (approximately  40pm)  of  the  lacunar  spacing,  the  field 
shown  in  Figure  7  is  5L  by  5L  (200pm  by  200pm);  both  panels  of  Figure  8  are  on 
a  scale  of  about  1/500  that  of  Figure  7,  approximately  0.4pm  by  0.4pm.  The  top  of 
Figure  8  shows  the  annular  shape  of  the  transverse  cross  section  and  the 
bottom  shows  a  longitudinal  cross  section.  The  annular  region  exists  between 
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Figure  6.  Diagram  of  two  osteocytes  (1)  in  the 
lamellar  bone  of  calcified  bone  matrix  (3). 
Two  neighboring  lamellae  (2)  with  different 
collagen  orientations  (7)  are  visible.  The 
osteocytic  cell  bodies  are  located  in  lacunae 
and  are  surrounded  by  a  thin  layer  of 
uncalcified  matrix  (4).  Their  cell  processes 
(5)  are  housed  in  canaliculi  (6).  Some  of  the 
gap  junctions  between  the  cell  processes  are 
indicated  (arrows). 


the  surface  of  the  osteocytic  cell  membrane  and  the  surface  of  the  canalicular 
wall.  Figure  8  shows  an  idealization  of  the  cross-bridging  fibers  and  the  open 
gap  A  between  the  fibers.  The  cross-bridging  fibers,  which  traverse  the  annular 
region,  represent  the  cross  linking  proteoglycans  of  the  surface  glycocalyx 
structure.  It  is  important  to  note  that  the  canaliculus  shown  in  Figure  8  is  one 
of  the  many  canaliculi  indicated  on  the  much  larger  scale  idealization  shown 
in  Figure  7.  Figure  9  gives  an  enlarged  view  of  the  cross  linking  proteoglycan 
net  in  a  small  central  section  of  a  typical  canaliculus,  such  as  the  one  labeled 
[1]  in  Figure  7.  Figure  9  is  another  idealized  model  of  the  cross  linking 
proteoglycan  net  in  the  annular  region  between  the  osteocytic  cell  membrane 
and  the  surface  of  the  canalicular  wall.  Figure  9  is  on  a  scale  of  about  1/4  the 
scale  of  Figure  8  and  1/2000  the  scale  of  Figure  7.  Although  the  model  of  the 
cross  linking  proteoglycan  net  used  in  the  analysis  of  [22],  shown  in  Figure  8, 
does  not  mimic  the  detail  of  the  representation  of  the  cross  linking  proteoglycan 
net  shown  in  Figure  9,  the  work  of  Weinbaum  et  al.  [25]  shows  that  the  effective 
fluid  resistance  is  roughly  the  same  for  both  periodic  and  random  matrix 
structures  with  the  same  average  fiber  spacing. 

The  four  fundamental  lengths  that  characterize  the  local  fluid  behavior 
are  shown  in  these  figures.  One  is  the  center  to  center  distance  between  two 
lacunae  L  illustrated  in  Figure  7  and  a  second  is  the  radius  of  curvature  of  the 
annulus  b  as  shown  in  Figure  8.  The  other  two  fundamental  lengths  are  shown 
in  Figures  7  and  8:  the  gap  height  of  the  annulus,  (b  -  a),  and  the  matrix  fiber 
spacing  A. 

8.  HIERARCHICAL  PERMEABILITY  MODEL  LEVELS 

The  purpose  of  this  section  is  to  describe  the  three  permeability  constants 
associated  with  the  three  hierarchical  length  scales  A  ,  (b  -  a)  and  L.  These 


Figure  7.  An  idealized  model  of  a  trabecular  cross  section  illustrating  the  lacunar  canalicular 
porosity  associated  with  the  fluid  space  surrounding  the  osteocytes  in  a  trabecula.  The  number 
[1]  indicates  one  of  the  numerous  canaliculi  that  connect  lacunae  [21.  The  number  [11  also 
indicates  the  region  of  this  Figure  that  is  enlarged  many  times  in  Figure  8.  Each  lacuna  1 2] 

houses  an  osteocyte 
(not  shown),  and 
osteocytic  processes 
(not  shown)  run  from 
each  osteocyte  about 
half  way  down  each 
canaliculus  to 
connect,  with  a  gap 
junction  (not  shown), 
to  the  osteocytic 
processes  from  the 
osteocyte  housed  in 
the  adjacent  lacuna. 
The  number  r3 1 
indicates  the  unit 
cell  used  in  our 
calculation  of  the 
large  scale 
permeability 
constant  k,  [4] 
indicates  bone  cells 
on  the  surface  of  the 
trabecula.  The 
linear  stress 
distribution  that  the 
cross  section  is 
subjected  to  at  one 
extreme  of  the  cyclic 
loading  is  indicated 
by  [5]  and  at  the  other 
extreme  by  [61.  This 
linear  stress 
distribution 
represents  a  periodic, 
combined  axial  and 
bending  loading. 

The  length  L  is  the 
center  to  center 
distance  between  two 
lacunae. 

levels  correspond  to  the  three  hierarchical  levels  of  the  model  described  in  the 
previous  section.  The  three  permeability  constants  are  denoted  by  kp,  kp  efr  and 
k,  respectively,  and  each  has  the  units  of  area.  The  local  or  small  scale 
permeability  constant  kp  represents  the  permeability  of  the  bone  fluid  through 
the  cell  membrane  glycocalyx  of  cross-bridging  fibers  of  the 
glycosarainoglycans  (GAGS)  of  the  proteoglycans,  without  consideration  of  the 
boundaries  of  flow,  namely  the  surface  of  the  cell  process  and  the  wall  of  the 
canaliculus.  In  order  to  account  for  the  boundary  layer  effects  at  these 
boundaries  of  the  por'  space,  the  effective  permeability  kp>efr  constant  is 
introduced.  It  represents  an  averaging  of  the  small  scale  permeability  kp  over 
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(a) 


Figure  8.  An  idealized  model  of  the  cross 
section  of  a  canaliculus:  (top)  transverse 
cross  section  showing  the  annular  shape  of  the 
region,  (bottom)  the  longitudinal  cross 
section.  This  Figure  is  on  a  scale  of  about 
1/500  times  the  scale  of  Figure  7.  This  Figure 
gives  an  enlarged  view  of  a  small  central 
section  of  a  typical  canaliculus,  such  as  the 
one  labeled  [lj  in  Figure  7.  The  number  [1]  of 
this  Figure  indicates  the  surface  of  the 
osteocytic  cell  membrane,  [2]  the  gap  junction 
between  two  connecting  osteocytic  cell 
processes,  [3]  the  surface  of  the  canalicular 
wall,  [4]  a  typical  fiber  traversing  the  annular 
region  and  representing  the  cross  linking 
proteoglycans.  The  dotted  lines  [5]  represent 
the  outer  edge  of  the  viscous  boundaiy 
layer.The  velocity  distribution  varies  from  a 
radially  distorted  parabola  (6],  to  a  plug  flow 
[7]  as  the  effective  viscous  boundary  layer  diminishes  due  to  an  increase  in  the  fiber  density. 
Three  of  the  four  fundamental  lengths  that  characterize  the  local  fluid  behavior  are  shown:  the 

radius  of 
curvature  of 
the  annulus 
b,  the  gap 
height  of  the 
annulus, 

(b  -  a),  and 
the  open  gap  A 
between  the 
fibers.  The 
fourth 

fundamental 
length  is  the 
center  to 
center 
distance 
length  L 
between  two 
lacunae 
shown  in 
Figure  7. 


the  annular  pore  space  of  the  canaliculus.  The  large  scale  permeability 
constant  k  represents  the  permeability  of  the  entire  canalicular  system  of  a 
lacuna,  a  system  consisting  of  all  the  canaliculi  associated  with  one  lacuna. 
The  following  items  are  discussed  below  in  the  order  indicated:  the  largest 
scale  constant  k;  then  the  smallest  scale  constant  kp;  formulas  for  kp  eff  and  k; 
and  the  relationship  of  kp  to  D. 

The  large  scale  permeability  constant  k  is  introduced  as  part  of  Darcy's 
law.  If  u  represents  the  fluid  velocity  vector  and  Vp  the  gradient  of  the  pore 
fluid  pressure  p,  then  Darcy's  law  is 


u  =  -KVp, 


(5) 


15? 


Figure  9.  An  idealized  model  of  the  cross  linking  proteoglycan  net  in  the  annular  region 
between  the  osteocytic  cell  membrane  and  the  surface  of  the  canalicular  wall.  This  Figure  is  on 
a  scale  of  about  1/4  the  scale  of  Figure  8  and  1/2000  the  scale  of  Figure  7.  This  Figure  gives  an 
enlarged  view  of  the  cross  linking  proteoglycan  net  in  a  small  central  section  of  a  typical 
canaliculus,  such  as  the  one  labeled  [1J  in  Figure  7.  The  number  [1]  of  this  Figure  indicates  the 
osteocytic  cell  membrane,  [2]  the  surface  of  the  canalicular  wall,  and  [3]  the  darkened  ellipses 
represent  albumin  molecules. 

where  the  coefficient  of  permeability,  k  =  k/p,  is  a  ratio  of  the  permeability 
constant  to  the  fluid  viscosity  p. 

The  local  or  small  scale  permeability  constant  kp  for  the  fluid  filled 
matrix  is  introduced  as  part  of  Brinkman's  [26]  equation.  The  theory  in  Tsay 
and  Weinbaum  [27]  has  shown  that  Brinkman's  equation  provides  a  highly 
accurate  approximation  for  bounded  flow  in  a  channel  with  cross-bridging 
fibers  provided  the  aspect  ratio  of  the  fibers  is  greater  than  five.  This  criterion 
for  the  aspect  ratio  is  easily  satisfied  for  long  slender  proteoglycan  fibers.  We 
thus  describe  the  fluid  annulus  surrounding  the  osteocytic  process  by  a 
Brinkman  equation  for  a  fiber  filled  medium, 

Vp  =  -  — -  u  +  pV2u  .  (6) 

kp 

Brinkman's  equation  (6)  reduces  to  Darcy's  law  (5)  in  the  dense  fiber  limit 

2 

when  kp  is  small  and  to  the  standard  Stokes  equation  (i.e.,Vp  =  pV  u)  in  the 
limit  where  the  fibers  are  dilute  and  kp  is  large.  The  Brinkman  equation  (2) 
shows  that  the  thickness  of  the  fiber  induced  boundary  layer  is  of  the  order  vk^. 

The  average  velocity  of  flow  through  a  canalicular  pore  annulus 
containing  the  fiber  matrix  is  obtained  by  dividing  the  volume  flow  rate  Q 
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through  the  canaliculus  by  the  annular  area  x(b2  -  a2).  The  effective 
permeability  kp>eff  is  determined  from  the  average  velocity  Q/7c(b2  -  a2),  where  Q 
is  calculated  from  the  velocity  profiles  in  the  annulus  obtained  from  the 
Brinkman  equation  (6),  the  fluid  viscosity  p,  and  the  pressure  gradient  in  the 
axial  direction  of  a  canaliculus,  |Vpj,  thus 


kp.eff  — 


Q 

?c(b2  -  a2) 


(7) 


The  large  scale  permeability  constant  k  can  then  be  determined  by  first 
calculating  the  large  scale  volume  flow  rate  by  multiplying  the  volume  flow 
rate  Q  through  one  canaliculus  by  n/L2-  the  number  of  canaliculi  n  per  unit 
area  L2  crossing  the  face  of  a  unit  cell.  The  quantity  nQ/L2  is  equal  to  the 
average  velocity  u  in  the  axial  direction  of  a  canaliculus.  Darcy’s  law  (1)  can 
then  be  inverted  to  obtain  an  expression  for  the  large  scale  permeability 
constant  k, 

.  (8) 


This  is  the  large  scale  permeability  constant  k  for  the  unit  cell  illustrated  in 
Figure  7.  The  cell  is  assumed  to  have  n  canaliculi  per  unit  area  in  each  of  its 
three  principal  directions. 

The  value  of  the  small  scale  permeability  constant  kp  for  an  infinite 
medium  can  be  related  to  the  fiber  spacing  parameter  A.  A  relationship  of  this 
type  can  be  obtained  from  the  rigorous  solution  for  the  flow  perpendicular  to  a 
two-dimensional  square  array  of  fibers.  This  solution,  however,  is  a 
complicated  function  of  the  fiber  volume  fraction  for  this  geometry  and  Tsay 
and  Weinbaum  [27]  show  that  the  following  simple  approximate  formula  for  kp 
as  a  function  of  A  is  accurate  to  within  ten  percent  for  all  volume  fractions  less 
than  0.7: 

A  2.377 

kp  =  0.0572a2(^-)  ,  (9) 

where  a0  is  the  fiber  radius.  A  rough  approximation  to  equation  (9)  is  that  kp  is 
approximately  equal  to  the  square  of  A;  thus,  since  the  Brinkman  equation 
shows  that  the  thickness  of  the  fiber-induced  boundary  layer  is  of  the  order 
it  follows  that  the  boundary  layer  thickness  is  approximately  equal  to  the  fiber 
spacing  parameter  A. 

9.  MODEL  PREDICTIONS 

The  model  described  above,  in  conjunction  with  the  theory  of  stress- 
induced  flow  of  interstitial  fluid  in  porous  solids  (Biot  theory),  is  used  to  relate 
the  externally  applied  cyclic  mechanical  loading  applied  to  the  whole  bone  to 
the  fluid  shear  stress  experienced  by  the  osteocytic  cell  process  in  a  bone 
canaliculus.  The  first  step  is  to  employ  the  Brinkman  equation  (6)  to  find  the 


fluid  velocity  in  a  canaliculus  through  the  fiber  net  representing  the  cell 
membrane  glycocalyx  of  cross-bridging  proteoglycan  fibers.  This  velocity  can 
then  be  used  to  relate  the  fluid  shear  stress  on  the  cell  process  to  the  pressure 
gradient  in  the  canaliculus.  The  pressure  gradient  in  the  canaliculus  is 
related  to  the  applied  mechanical  loading  by  the  Biot  theory  [28],  Biot  theory  has 
two  basic  assumptions:  (i)  Darcy's  law  (5)  governs  the  flow  of  the  interstitial 
fluid;  (ii),  the  matrix  material  of  the  porous  medium  follows  Hooke's  law.  The 
permeability  k  is  employed  in  the  Biot  theory.  It  is  calculated,  as  is  kP)eff,  from 
the  fluid  velocity  in  a  canaliculus  determined  above.  The  velocity  is  integrated 
over  the  cross  section  to  obtain  the  total  flux  Q  which,  in  turn,  is  used  in  the 
formulas  (7)  and  (8)  for  kp>eff  and  k,  respectively. 

Biot  theory  is  then  used  to  calculate  the  pore  fluid  pressure  relaxation 
time  from  the  expression  for  the  large  scale  permeability  k  and  known  data  on 
the  elastic  constants  of  bone.  This  relaxation  time  has  been  measured 
experimentally  and  shown  to  be  approximately  2  sec.  However,  for  the  typical 
values  of  a  and  b  of  the  inner  and  outer  radii  of  the  canaliculus,  0.087  and  0.20 

pm,  the  model  predicts  a  fluid  pressure  relaxation  time  of  the  order  10'2  sec., 
much  less  than  the  experimentally  measured  2  sec.,  if  the  proteoglycan  matrix 
were  not  present  in  the  fluid  annulus.  However,  when  the  annulus  is  filled 
with  a  matrix  whose  principal  component  is  the  GAG  side  chains  of  the 
proteoglycans,  fibers  with  a  typical  radius  of  typical  radius  of  a0  =  0.6nm,  one 
finds  for  the  same  values  of  b  and  a  that  a  pore  relaxation  time  of  2  sec.  can  be 
achieved  when  the  fiber  spacing  A  in  (9)  is  7  nm.  This  predicted  spacing 
corresponds  nearly  exactly  with  the  size  selective  matrix  structures  that  have 
been  observed  at  the  membrane  surface  of  endothelial  cells  in  many  tissues 
where  this  structure  serves  as  a  capillary  filter  for  plasma  proteins,  Michel 
[29],  It  is  of  particular  significance  that  bone  pore  fluid  is  a  plasma  derivative 
that  is  rich  in  albumin.  This  molecule  has  been  hypothesized  to  order  the  side 
chains  of  the  proteoglycans,  as  shown  in  Figure  9,  and  thus  determine  the 
characteristic  dimension  of  the  molecular  sieve,  for  virtually  all  capillary 
tissues. 

The  second  important  prediction  of  the  model  is  the  shear  stress  acting 
on  the  membranes  of  the  osteocytic  processes.  It  is  important  to  note  that  the 
theory  predicts  that  no  fluid  movement  will  occur,  and  thus  no  shear  stress, 
when  the  loads  applied  to  the  bone  are  steady.  The  results  we  report  are  for  the 
cyclic  loading  conditions  at  1  Hz.  For  a  matrix  with  the  above  fiber  spacing  one 
finds  that  for  whole  bone  subject  to  physiological  strains,  a  peak  shear  stress  of 
between  10  and  20  dyne/cm2  will  be  obtained,  the  maximum  value  depending  on 
the  ratio  of  the  applied  bending  to  axial  load.  These  shear  stresses  are  precisely 
in  the  same  range  as  the  fluid  shear  stresses  acting  on  endothelial  cells  where 
an  adaptive  remodeling  of  the  vasculature  has  been  observed.  From  geometric 
considerations  the  portions  of  the  membranes  of  the  osteocytes  that  lie  within 
the  lacunae  would  experience  considerably  lower  shear  stress  levels.  Although 
the  strain  is  mechanically  sensed  by  the  osteocytes,  it  can  be  communicated  to 
the  osteoblasts  and  bone  lining  cells  at  the  surface  of  the  trabeculae  by  changes 
in  ion  currents  through  cell  to  cell  membrane  ion  channels  in  the  gap 
junctions  at  the  apical  ends  of  the  osteocytic  process,  as  shown  in  Figure  8b. 

These  theoretical  predictions  have  important  implications  for  the 
possible  nature  of  the  excitation  process  that  control  stress-related  resorption 
and  deposition  of  bone  tissue.  The  results  suggest  that  the  same  fiber  matrix 
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microstructure  characteristic  of  other  filtration  tissues  exists  in  the  fluid 
sheath  surrounding  the  osteocytic  processes.  Theoretical  predictions  for  the 
fluid  shear  stress  on  the  membranes  of  the  osteocytic  processes  correspond 
closely  to  the  experimentally  measured  range  of  stresses  where  Ca++  has  been 
observed  to  be  released  from  intracellular  stores  in  shear  stress-activated 
endothelial  cells.  The  presence  of  Ca++  is  known  to  control  the  opening  and 
closing  of  the  ion  channels  in  the  gap  junctions,  thereby  changing  the  potential 
difference  between  cells.  Weinbaum  et  a £.[22]  propose  that  the  resulting  change 
in  potential  through  the  intracellular  pathway  is  the  electrical  stimulus  for  the 
activation  of  the  osteoblasts. 
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Abstract 

The  pulmonary  airways  are  lined  with  a  surfactant-rich  liquid  layer  which  thickens 
markedly  with  respiratory  disease.  The  dynamics  of  this  liquid  layer  have  been  investigated, 
primarily  through  numerical  simulation.  Two  specific  cases  were  examined:  (i)  When  the  layer 
is  sufficiently  thick,  a  fluid  dynamic  instability  may  occur,  causing  the  liquid  layer  to  form  a 
meniscus  that  obstructs  the  airway  lumen.  The  numerical  results  indicate  that  this  occurs  during 
deflation  at  approximately  25%  maximal  lung  volume  for  a  liquid  layer  initially  10  pm  thick  a;  ■ 
that  the  presence  of  surfactant  can  delay  or  even  prevent  this  process,  (ii)  When  a  bolus 
containing  exogenous  surfactant  is  instilled  '  *he  lung,  it  spreads  due  to  Marangoni  convection. 
The  influence  on  spreading  rate  of  the  characteristics  of  the  instilled  bolus  and  of  the  resident 
liquid  layer  have  been  examined. 


1.  INTRODUCTION 

When  you  exhale  as  far  as  possible,  a  volume  of  gas  remains  in  the  lung  -  usually  about 
25%  of  the  Total  Lung  Capacity  (TLC)  —  that  cannot  be  eliminated  no  matter  how  much  effort  is 
expended.  While  several  factors  contribute  to  this  observation,  the  reason  in  many  subjects, 
especially  those  with  various  forms  of  respiratory  disease,  is  that  there  is  a  thin  liquid  layer  that 
lines  the  airways  of  the  lung  from  the  gas  exchange  region  in  the  periphery  (the  "alveolar  zone") 
all  the  way  to  ti  e  trachea.  The  liquid  layer,  if  sufficiently  thick  relative  to  the  airway  radius,  can 
undergo  a  fluid  dynamic  instability  akin  to  the  Rayleigh  instability  of  a  liquid  jet;  a  small 
perturbation  in  film  thickness  grows  for  wavelengths  greater  than  2rca  where  a  is  the 
undisturbed  inner  radius  of  the  film  [11.  Moreover,  if  the  liquid  layer  is  thin  relative  to  the 
airway  radius,  this  instability  ultimately  leads  to  the  formation  of  an  unduloid  or  portions  of  an 
unduloid  separated  by  dry  regions  (2).  For  thicker  films,  as  might  be  expected  in  respiratory 
disease  or  at  lower  lung  volumes,  the  instability  leads  instead  to  the  formation  of  a  meniscus  that 
blocks  the  lumen  of  the  airway  producing  what  is  termed  "airway  closure"  [3].  This 
phenomenon  is  thought  to  be  responsible  for  the  observation  that  lungs  trap  gas  at  low  volume 
and  may  also  be  the  cause  of  an  increase  in  minimal  lung  volume  in  diseases  which  are 
associated  with  excess  airway  liquid.  In  such  diseases,  airway  closure,  if  it  persists,  can  lead  to 
gas  absorption  peripheral  to  the  meniscus  ("atelectasis"),  with  a  concomitant  impairment  in  gas 
exchange  capability. 

The  thickness  of  the  pulmonary  liquid  layer  varies  from  about  0.1  |im  in  the  smallest 
airspaces,  the  alveoli,  to  about  10  pm  in  the  trachea  in  a  normal,  healthy  lung  (41,  and  can 
increase  many-fold  in  respiratory  disease.  What  regulates  its  thickness  remains  largely 
unknown  although  the  epithelial  cells,  especially  those  in  the  alveolar  zone,  have  the  capability 
of  pumping  sodium  ion  out  of  the  airway  liquid,  thus  setting  up  an  osmotic  gradient  that  draws 
water  out  of  the  airway  lumen  and  into  the  surrounding  tissue  space,  the  interstitium  [5|.  In 
disease,  water  originating  in  the  pulmonary  capillaries  enters  the  interstitial  space  causing 
pulmonary  edema,  and  often  leaks  through  the  alveolar  epithelium  into  the  alveolar  airspace. 
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The  liquid  layer  has  many  functions,  among  which  are  the  transport  of  airborne  impurities 
toward  the  mouth,  the  protection  of  the  thin  layer  of  epithelial  cells  that  line  the  airways,  and  the 
humidification  of  the  air  that  you  breathe  in.  Although  largely  comprised  of  water,  the 
properties  of  this  fluid  are  complicated  by  the  presence  of  a  potent  surfactant  produced  by 
epithelial  cells  in  the  alveolar  zone,  of  mucus,  a  highly  viscous,  non-Newtonian  substance, 
introduced  by  goblet  cells  contained  in  the  lining  of  the  airway  walls,  and  of  proteins,  some  of 
which  are  needed  for  normal  surfactant  function  and  others,  such  as  those  originating  from  the 
blood  plasma,  which  act  to  impair  normal  function  of  surfactant.  Far  from  static,  this  layer  is  in 
constant  motion  due  in  part  to  the  action  of  ciliated  cells  along  the  airway  that  continually  drive 
the  liquid  toward  the  trachea.  Additional  factors  that  can  affect  the  motion  and  distribution  of 
the  airway  liquid  include  coughing,  transport  across  the  airway  epithelium,  gravity,  changes  in 
lung  volume,  and  interfacial  shear  due  to  normal  breathing. 

One  of  the  more  fascinating  yet  little-studied  characteristics  of  this  liquid  layer  is  the 
potential  for  flows  generated  by  surface-tension  gradients  (the  Marangoni  effect).  In  the  normal 
lung,  there  exists  a  standing  gradient  with  surface  tension  low  in  the  alveolar  zone  and  high  in 
the  central  airways  [6,7].  This  gradient,  in  combination  with  the  existence  of  a  contiguous 
liquid  layer,  must  tend  to  generate  a  steady  flow  of  liquid  toward  the  mouth  totally  apart  from 
flows  generated  by  the  cilia.  Such  naturally-occurring  Marangoni  flows  are  an  aspect  of  liquid 
transport  in  the  lung  that  has  never  been  subjected  to  systematic,  quantitative  analysis. 

The  role  of  pulmonary  surfactant  in  disease  is  widely  recognized.  For  example,  in  neonatal 
respiratory  distress  syndrome  (RDS)  the  primary  pathology  is  a  lack  of  normal  surfactant 
production.  The  currently  accepted  and  generally  successful  treatment  for  infants  suffering 
from  RDS  is  direct  instillation  of  an  exogenous  surfactant,  known  as  surfactant  replacement 
therapy  (SRT)  [8],  In  a  small  but  significant  subgroup,  however,  SRT  fails  for  reasons  that  are 
not  yet  fully  understood,  but  may  be  due  to  factors  that  influence  the  transport  of  exogenous 
surfactant  to  the  lung  periphery.  These  issues  have  led  to  the  recent  interest  in  transient 
Marangoni  flows  in  thin  films  [9,10,11].  The  observation  that  surfactant  is  efficiently  and 
rapidly  distributed  throughout  the  lung  has  also  led  to  the  notion  that  surfactants  might  be  used 
as  vehicles  for  drug  delivery  [12],  either  by  direct  instillation  of  a  bolus  or  via  an  aerosol. 

In  this  paper,  we  discuss  two  topics  related  to  the  dynamics  of  the  pulmonary  liquid  layer. 
The  first  is  airway  closure  which  we  simulate  for  the  smallest  non-alveolated  airways  (the 
terminal  bronchioles)  found  in  the  lung,  including  the  effects  on  closure  of  endogenous 
surfactant.  The  second  is  the  transient  distribution  of  a  bolus  of  exogenous  surfactant,  placed  in 
an  axi-symmetric  distribution  on  the  walls  of  a  tube  of  constant  diameter.  In  addition,  some 
qualitative  experimental  results  are  presented  for  airway  closure  in  a  large-scale  model. 


2.  METHOD  OF  ANALYSIS 


The  purpose  of  this  modeling  study  is  (i)  to  elucidate  the  time-scales  for  airway  closure  and 
the  lung  volumes  at  which  closure  occurs  for  a  given  liquid  layer  thickness,  and  (ii)  to  identify 
the  rate  of  spread  of  an  instilled  surfactant  monolayer.  For  the  analysis  of  these  problems  in  a 
thin  liquid  layer  lining  the  inside  of  a  tube  of  cylindrical  cross-section  (Fig.  l),we  employ  a 
numerical  method  solving  the  coupled  continuity,  surfactant  conservation  and  axial  momentum 
equations.  We  assume  that  (i)  the  surfactant  monolayer  is  insoluble  in  the  aqueous  sub-phase, 
(ii)  that,  for  cases  simulating  expiration,  the  model  airway  retains  a  fixed  geometry  as  its 
dimensions  decrease,  (iii)  the  aqueous  sub-phase  behaves  as  a  Newtonian  fluid,  (iv)  surface 
viscosity  can  be  neglected,  and  (v)  the  lining  is  thin  and  disturbances  are  of  long  wavelength. 
For  a  more  detailed  derivation  of  the  following  equations  see  [13]. 

Consistent  with  these  assumptions,  the  continuity  equation  can  be  expressed  as: 


dA  _ 
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Fig.  1.  Geometry  used  in  the  numerical  simulations,  (a)  Airway  closure  model,  (b)  Surfactant 
spreading  model. 

where  A  =  (s2-a2)/2  (all  terms  are  defined  in  the  nomenclature).  In  order  to  deal  with  problems 
in  which  the  airway  dimensions  change  with  time,  we  introduce  the  following  transformation: 


z 

Uf) 


(2) 


which,  when  introduced  to  eqn.  (1),  yields: 

Q  +  ld{.AZl+AL  =  0  (3) 

dt  L  dx  L 

where  y  is  the  mean  fluid  velocity  relative  to  the  moving  wall,  V  =  w  -  xL- 

Applying  a  similar  transformation  to  the  surfactant  conservation  equation  produces  the 
expression: 
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The  axial  momentum  equation,  with  gravity  oriented  parallel  to  the  tube  axis  is  written 
dw  dw  dw\  dp  u  d  (  <?w\ 

ir+“*+w*r-*+7*nrJ+p*  <6) 


the  solutions  of  which  must  satisfy  boundary  conditions  on  the  radial  velocity  . 
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axial  velocity  at  the  wall 
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and  the  velocity  gradient  at  the  interface 
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which  is  what  gives  rise  to  the  Marangoni  effect.  For  all  cases  discussed  below  we  assume 
symmetry  about  z  =  0  and  impose  the  condition  of  zero  axial  velocity  at  z  =  L. 

The  radial  momentum  equation  is  not  needed  since,  in  the  long-wavelength  approximation, 
the  radial  pressure  gradient  can  be  neglected  compared  with  its  longitudinal  variation. 
Therefore,  pressure  in  the  liquid  film  is  determined  by  Laplace's  law  applied  across  the  gas- 
liquid  interface,  used  in  the  form: 


where  gas  pressure,  pgas,  is  assumed  constant.  Combining  eqns.  (6)-(9)  and  integrating  the 
momentum  equation  across  the  liquid  layer  in  the  manner  of  Johnson  et  al.[3],  we  obtain 
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This  system  of  equations  is  solved  numerically  using  a  third-order  Adams-Bashforth  technique 
for  time-derivatives  and  central  differences  for  spatial  derivatives.  In  some  of  the  following 
calculations,  the  effects  of  axial  inertia  were  negligible,  resulting  in  a  somewhat  simpler  form  of 
the  equations.  In  addition,  although  surface  diffusion  is  included  in  some  of  the  fo’lowing 
results,  it  was  found  to  be  negligible  in  all  cases  when  realistic  numerical  values  are  used. 

For  simulation  of  the  lung  during  expiration,  all  linear  dimensions  are  taken  to  vary  as  the 
cube  root  of  lung  volume  [14],  which  is  assumed  to  decrease  linearly  with  time: 
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3.  EXPERIMENTS  ON  AIRWAY  CLOSURE 

To  provide  a  qualitative  illustration  of  airway  closure,  experiments  were  conducted  in  a 
large-scale  model  of  a  segment  of  the  airway  tree,  manufactured  by  a  computer-aided  milling 
process  by  Hammersly  and  Olson  [15].  Two  immiscible  liquids  were  used,  silicone  oil  (p  = 
9.7  dyne  s/cm2,  p  =  0.97  g/cm3)  and  a  water-ethanol  mixture  (88.4%  water,  1 1.6%  ethanol  by 
weight)  that  was  density-matched  to  minimize  buoyancy  effects.  The  water-alcohol  mixture 
was  used  to  simulate  the  relatively  inviscid  air  and  was  dyed  to  distinguish  it  from  the  clear 
silicone  oil  that  simulated  the  liquid  lining  in  the  lung. 

The  system  was  prepared  for  an  experiment  by  first  filling  the  entire  network  with  silicone 
oil,  then  slowly  adding  the  water-alcohol  mixture,  leaving  a  thin,  nearly  uniform  layer  of 
silicone  oil  on  the  walls  of  the  network.  The  initial  thickness  of  the  liquid  layer  was 
approximately  proportional  to  the  airway  diameter  for  each  generation  (ho/s  =  0.30+0.02)  except 
for  regions  of  near  zero  thickness  near  the  flow  divider  (carina)  of  each  bifurcation.  When  the 
entire  network  had  been  purged,  the  flow  was  stopped  and  a  sequence  of  photographs  were 
taken  during  the  time  it  took  for  all  of  the  branches  to  become  occluded  by  a  meniscus.  A 
selected  set  of  these  photographs  is  shown  in  Fig.  2a-d. 

Following  the  stoppage  of  flow,  the  uniform  annular  film  in  each  airway  initially  grew 
toward  an  unduloid-like  shape  and  then  finally  to  complete  closure  by  a  meniscus.  The  process 
began  slowly,  then  proceeded  more  and  more  rapidly  as  the  amplitude  of  the  disturbance  grew, 
taking  a  total  of  from  about  15  to  25  minutes.  The  final  stages  in  the  joining  of  the  opposing 
interfaces  occurred  on  a  time  scale  of  seconds  whereas  the  process  leading  up  to  this  event  took 
many  minutes.  By  25  minutes,  all  branches  were  occluded,  producing  a  collection  of  "dog- 
bone-shaped"  dark  regions. 

Despite  the  tendency  for  the  menisci  to  be  slightly  nearer  the  larger  neighboring  branch 
(parent  airway)  in  each  instance,  there  was  little  evidence  to  suggest  that  a  significant  quantity  of 
liquid  was  exchanged  between  branches  on  the  time  scale  of  film  collapse;  each  meniscus 
occupied  roughly  the  same  fractional  volume  of  the  branch  at  the  end  of  the  experiment.  This 
may  have  been  at  least  partly  a  result  of  the  tendency  for  wall  liquid  to  be  drawn  from  the  bifur- 
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Fig.  2.  Sequence  of  photographs  showing  the  progression  of  a  viscous  liquid  film  lining  the 
inside  walls  of  a  plexiglas  model  from  a  layer  with  ho/s  nearly  uniform  at  0.23  to  a  final  state 
with  all  airways  occluded  by  a  meniscus.  Photos  correspond  to  the  following  times,  referenced 
to  the  time  that  the  airway  shown  in  (a)-(c)  is  first  purged:  (a)  t  =  3:45  min.  (b)  t  =  22:35  min. 
(c)  t  =  23:05  min.  (d)  t  =  30:45  min.. 


cation  zones  toward  the  center  of  the  branch  as  the  instability  grew,  leaving  behind  a  narrowed 
layer  where  exchange  would  have  to  take  place.  In  every  instance,  the  wavelength  of  the 
instability  that  ultimately  led  to  closure  was  comparable  to  the  length  of  the  branch  itself. 


4.  PREDICTIONS  OF  AIRWAY  CLOSURE 

Airway  closure  was  modeled  for  a  lung  during  expiration  using  the  full  form  of  eqns.  (3), 
(4),  (8)  -  (12).  A  single  airway  was  considered,  having  the  idealized  geometry  depicted  in  Fig. 
la.  The  airway  size  and  liquid  layer  thickness  at  maximal  lung  volume  were  selected  to 
represent  typical  values  for  a  terminal  bronchiole  (see  caption  for  Fig.  4).  It  is  assumed  that  the 
airway  remains  geometrically  similar  and  that  no  liquid  escapes  from  the  ends  at  x  =  1,-1.  In 
addition,  the  equation  of  state  relating  surfactant  concentration  to  surface  tension  is  given  in  Fig. 
3,  taken  from  the  measurements  of  Schurch  [16], 

A  typical  result  is  shown  in  Fig.  4.  In  Fig.  4a,  the  thickness  of  the  liquid  layer  is  shown  as 
a  function  of  time,  demonstrating  the  transition  from  an  initially  uniform  layer  (perturbation 
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Fig.  3  Surfactant  equation  of  state  for  lung  surfactant  [16]  used  in  the  calculations  of  airway 
closure.  Minimum  surface  tension  is  taken  to  be  1  dyne/cm.  Surfactant  concentration  is  shown 
here  normalized  to  the  value  at  o  =  30  dyne/cm  which  corresponds  to  2.3x  10iS  molecules/m2 


amplitude  0.001  a o)  at  t  =  0,  through  the  growth  of  the  instability  and  ultimate  closure.  For 
times  less  than  about  400  msec  the  layer  remains  fairly  uniform,  but  thickens  due  to  the 
reduction  in  airway  size.  For  t  >  400  msec,  the  instability  rapidly  grows  and  liquid  is  drawn 
from  the  ends  of  the  tube  toward  the  center  until  at  t  =  436  msec,  the  protrusion  at  x  =  0  reaches 
the  centerline  and  closure  occurs.  It  should  be  noted  that,  in  all  these  simulations,  the 
approximations  made  in  the  analysis  break  down  in  the  final  stage  of  closure,  but  this  stage 
passes  so  quickly  that  it  has  relatively  little  influence  on  the  predicted  closure  times  (3]. 

For  the  case  shown,  the  liquid  film  is  at  first  stable  to  small  perturbations  and  the  initial 
disturbance  begins  to  relax  toward  a  cylindrical  surface.  By  the  time  lung  volume  falls  to 
0.89TLC,  however,  the  film  has  thickened  to  the  point  that  the  mean  inner  film  radius  is  less 
than  Lin  and  the  Rayleigh  instability  takes  over,  reversing  the  initial  tendency  toward  a  uniform 
film  and  generating  a  flow  from  the  ends  toward  a  growing  "bulge"  near  the  center.  Given 
sufficient  time  at  a  given  lung  volume,  this  instability  would  proceed  to  a  new  equilibrium 
configuration  (an  unduloid)  without  airway  closure.  However,  as  lung  volume  continues  to 
fall,  the  ratio  of  liquid  volume  to  tube  volume  increases  above  a  critical  value  of  0.297  (at  a  lung 
volume  of  0.26TLC)  and,  rather  than  forming  an  unduloid,  a  closed  meniscus  is  the  stable, 
minimum  surface  energy  state.  For  different  geometries  and/or  different  liquid  volumes,  this 
sequence  can  be  altered  and  the  critical  volume  condition  might  be  met  before  the  Rayleigh 
instability  causes  the  film  to  become  unstable;  in  that  case  it  is  never  possible  to  obtain  a  stable 
unduloid.  These  events  are  illustrated  in  the  stability  map  of  Fig.  5  where  a  reduction  in  lung 
volume  can  be  viewed  as  moving  downward  along  a  line  of  fixed  L/rcs.  For  case  1,  the  film 
begins  as  a  stable,  uniform-thickness  layer,  passes  through  a  region  in  which  the  stable 
configuration  is  an  unduloid,  and  ends  up  forming  a  liquid  bridge  as  a/s  falls  below  a  value  of 
about  0.8.  Case  2  is  one  in  which  the  only  instability  ultimately  leads  to  a  liquid  bridge,  by¬ 
passing  the  unduloid  state  entirely. 

The  pressure  distributions  shown  in  Fig.  4b  correspond  to  the  same  times  and  lung 
volumes  as  shown  in  4a.  The  interesting  feature  here  is  that  the  minimum  pressure  at  the  wall  is 
observed  at  maximal  lung  volume  when  the  radius  of  curvature  is  largest.  This  counter-intuitive 
result  is  due  to  the  presence  of  surfactant  at  concentrations  typical  of  those  found  in  the  lung. 
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Fig.  4.  Airway  closure  in  a  contracting  airway  with  variable  surface  tension.  At  TLC,  L  = 
0.075  cm,  s  =  0.025  cm,  a  =  0.024  cm.  Fluid  properties:  p  -  0.01  dyne-sec/cm2,  p  =  lg/cm2 
Expiration  rate  =  200%  TLC/sec.  (a)  Film  thickness  at  four  times  (in  msec),  (b)  pressure  in 
liquid  layer  at  same  times,  (c)  surfactant  concentration  at  same  times.  Closure  occurs  at  435 
msec. 


Fig.  5  Stability  map  for  the  forma* ion  of  an  unduloid  and  liquid  bridge. 


As  the  interface  contracts  during  expiration,  the  surfactant  concentration  rises  (see  Fig.  4c)  and 
the  surface  tension  consequently  falls.  With  sufficient  surfactant  present,  it  falls  so  rapidly  that 
a/a  actually  decreases  as  the  airway  contracts.  The  magnitude  of  the  liquid  pressure  falls  from 
an  initial  value  of  about  1250  dynes/cm3  to  about  100  dynes/cm3  and  remains  low  for  the 
duration  of  expiration.  The  gradients  in  surfactant  concentration  remain  relatively  small  until  the 
final  stages  of  closure,  but  are,  in  fact,  large  enough  to  effectively  "freeze"  the  interface  due  to 
the  Marangoni  effect.  Thus,  closure  is  delayed  by  pulmonary  surfactant  as  a  result  of  two 
mechanisms.  First,  the  reduction  in  surface  tension  as  the  gas-liquid  interface  becomes 
increasingly  compressed  during  expiration  brings  about  a  consequent  reduction  in  the  pressure 
gradient  driving  the  instability  for  a  given  displacement  amplitude.  This  driving  force  scales 
with  the  surface  tension  and  its  reduction  due  to  airway  contraction  is,  in  most  cases,  the 
dominant  influence.  Second,  the  presence  of  a  surface  tension  gradient  produces  a  shear  stress 
at  the  interface  (see  eqn.  (9))  that  effectively  reduces  the  interfacial  velocity  to  near  zero  values. 
By  itself,  this  slows  the  instability  by  a  factor  of  about  four  —  for  a  given  pressure  gradient  and 
fim  thickness,  the  flow  rate  between  two  stationary  surfaces  is  lower  by  a  factor  of  four  than 
that  between  one  stationary  and  one  free  surface.  As  long  as  the  surface  tension  remains  non¬ 
zero,  neither  mechanism  causes  an  unstable  condition  to  become  stable;  they  only  affect  the  roue 
at  which  the  instability  proceeds  and,  therefore,  can  influence  the  lung  volume  at  which  closure 
occurs  for  a  given  rate  of  expiration. 

Fig.  6  shows  the  results  from  many  calculations  of  the  type  just  described,  both  with  and 
without  surfactant  and  with  different  values  of  surface  tension.  The  two  limiting  cases  (high 
and  low  values  of  the  expiration  rate)  can  be  understood  as  follows.  In  the  quasi-static  limit  (far 
to  the  left  in  the  figure)  closure  occurs  as  soon  as  both  conditions  (L  >  Jta  and  V)iq/s3  >  5.6)  are 
first  satisfied  and  is  independent  of  the  rate  of  expiration.  In  the  limit  at  the  right,  the  airway 
contracts  so  rapidly  that  the  instability  has  no  time  to  develop  and  closure  occurs  essentially 
when  the  volume  of  liquid  equals  the  airway  volume.  In  between  these  limits,  a  competition 
exists  between  the  growth  of  the  instability  and  the  reduction  in  airway  volume.  Interestingly, 
the  case  which  is  thought  to  be  most  representative  of  the  real  lung  (labeled  case  IV  for 
expiration  rates  of  about  20%TLC/s)  fall  into  this  transition  range,  suggesting  that  surfactant 
does  indeed  function  to  reduce  the  closing  volume  of  the  lung  under  normal  conditions.  Note 


Fig.  6  Closing  volume  for  different  rates  of  expiration.  For  the  three  curves  labled  Case  III, 
the  surface  tension  is  held  constant  at  1, 20  or  30  dynes/cm.  Horizonatal  lines  show  limits  for 
slow  (upper)  and  fast  (lower)  expirations. 
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that  closing  volume  in  a  normal  lung  usually  occurs  at  a  lung  volume  of  about  3()%TLC, 
roughly  consistent  with  these  predictions.  Also  consistent  with  these  predictions,  lungs  with 
insufficient  surfactant  (as  in  respiratory  distress  syndrome)  exhibit  gas  trapping  and  atelectasis 
at  higher  than  normal  lung  volumes  and  often  require  mechanical  ventilation  with  high  airway 
pressures  to  maintain  patent  airways. 

Other  factors,  not  accounted  for  in  this  model,  may  also  be  important.  The  airways  are 
compliant  vessels,  the  stiffness  of  which  is  largely  determined  by  the  tethering  forces  exerted  on 
their  external  surface  by  the  surrounding  lung  tissue  (parenchyma).  This  tethering  force  scales 
with  the  inflating  pressure  of  the  lung  and  approaches  zero  for  lung  volumes  in  the  range  of 
15%>TLC  [17].  Although  the  precise  influence  of  airway  compliance  in  the  real  lung  is  not 
known,  the  effects  of  compliance  have  been  incorporated  into  a  model  of  airway  closure  [18J 
and  demonstrate  that  compliance  hastens  the  closure  process  and  would  thus  cause  it  to  occur  at 
somewhat  greater  lung  volumes. 


5.  SPREADING  OF  AN  EXOGENOUS  SURFACTANT 

The  geometry  used  to  study  the  axial  dispersion  of  an  instilled  bolus  of  exogenous 
surfactant  is  shown  in  Fig.  lb.  In  this  model,  a  linear  equation  of  state  of  the  form,  a  =  omax- 
aT,  is  employed,  where  amax  is  the  maximum  value  of  surface  tension.  The  surfactant  is 
deposited  in  an  axi-symmetric  distribution  between  x  =  0  and  1  at  time  t  =  0.  This  introduces  a 
gradient  in  surface  tension  which  draws  the  interface  to  the  right,  dragging  with  it  fluid  in  the 
sub-phase.  As  shown  in  Fig.  7,  liquid  in  the  vicinity  of  the  deposited  surfactant  moves  outward 
in  the  form  of  a  propagating,  shock-like  disturbance  of  nearly  constant  amplitude.  Over  most  of 
the  domain,  flow  is  generated  by  the  (nearly  linear)  surface  tension  gradient  (Marangoni 
convection)  with  capillarity  associated  with  surface  curvature  playing  an  insignificant  role.  At 
the  leading  edge  of  the  advancing  front,  however,  the  steep  gradient  in  film  thickness  gives  rise 
to  large  interfacial  curvatures  and  capillarity  becomes  the  dominant  factor  leading  to  pressure 
gradients  in  the  film  that  generate  the  observed  flow.  Surface  curvature  and  capillarity  also  play 
a  role  in  the  trailing  region  near  the  site  of  surfactant  deposition,  although  to  a  somewhat  lesser 
extent.  A  simple  scaling  analysis,  confirmed  by  more  extensive  numerical  simulations  [1 1], 
demonstrates  that  the  height  of  the  surface  film  will  approach  twice  the  initial  thickness  of  the 
liquid  layer  and  that  its  leading  edge  will  propagate  at  a  speed  that  scales  as  (aMhopU-2)1#. 


Fig.  7  Dimensionless  liquid  layer  thickness  (a)  and  surfactant  concentration  (normalized  by 
concentration  at  z/Lo  =0)  (b)  for  the  spreading  of  a  surfactant  from  an  initial  deposition  between 
z/Lq  =  0  and  1.  Parameter  values:  ho/Lo  =  0.01,  ho/s  =  0.01,  Hb„i  =  0.0,  Bo  =  0.0.  Curves 
are  for  dimensionless  times  tcmaxho/(p.Lo2)  =  0,  2,  10,  and  16. 


A  refined  discretization  from  60  to  240  nodes  dampens  the  waves  at  the  trailing  edge  of  the 
propagating  disturbance,  but  has  little  effect  on  those  at  the  leading  edge.  Further  refinement,  to 
480  nodes,  produces  no  further  change  leading  us  to  conclude  that  while  the  wave  structure  at 
the  back  is  a  numerical  artifact,  the  precursor  wave  structure  is  real  and  ts  presumably  associated 
with  the  dispersive  character  of  the  equation  set.  Subsequent  results  were  obtained  using  the 
coarser  grid  since  this  resolution  seems  adequate  to  capture  the  primary  spreading  behavior. 

The  parameters  of  this  problem  include  the  ratio  of  initial  film  height  to  airway  radius 
(ho/s),  the  ratio  of  initial  film  height  to  the  axial  extent  of  the  region  of  deposited  surfactant 
(ho/Lo),  the  initial  surface  tension  difference,  the  Bond  number  representing  the  ratio  of 
gravitational  to  surface  tension  effects,  and  the  amount  of  bulk  liquid  deposited  on  the  film  in  a 
bolus  along  with  the  surfactant,  characterized  by  the  parameter  Hhoi  as  defined  by  the  following 
expression  for  the  initial  film  thickness  distribution: 
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for  -1  <  z/Lo  <  1  and  h/ho  =  1  elsewhere.  Each  of  these  was  varied  independently  in  order  to 
ascertain  their  effect  on  spreading  behavior. 

Simulations  in  which  ho/s  was  varied  between  0.001  and  0.04  (the  range  of  values 
anticipated  in  the  lung)  showed  no  significant  differences  from  the  behavior  depicted  in  Fig.  7. 
Airway  curvature  therefore  plays  essentially  no  role  in  surfactant  spreading.  Consequently, 
models  in  which  the  effects  of  airway  curvature  are  neglected  would  seem  perfectly  appropriate 
for  most  cases. 

Variations  in  ho/Lo  proved  to  be  somewhat  more  significant  as  shown  in  Fig.  8.  When 
expressed  in  this  non-dimensional  format,  the  main  influence  of  ho/Lo  is  on  the  shape  of  the 
propagating  disturbance  rather  than  its  speed  of  propagation  with  larger  values  giving  rise  to  a 
generally  more  rounded  appearance  and  a  broader,  less  well-defined  shock  zone.  In 
dimensional  terms,  since  time  is  normalized  by  (iLoVlOmaxho)  the  rate  at  which  the  wave 
propagates  is  proportional  to  h(/L02;  thinner  films  or  larger  regions  of  surfactant  coverage  tend 
to  reduce  the  rate  of  spread. 


Fig.  8  Dimensionless  liquid  layer  thickness  (a)  and  surfactant  concentration  (b)  showing  the 
effect  of  changing  ho/Lo.  Parameter  values:  are  the  same  as  in  Fig.  7  except  ho/Lo  =  0.01. 
Curves  are  for  dimensionless  times  tomaxho/(|aLo2)  -  C,  2,  10,  and  16. 


Given  this  dependence  of  propagation  speed  on  film  thickness,  it  seemed  reasonable  to 
attempt  to  augment  the  rate  of  transport  by  introducing  the  surfactant  in  a  liquid  volume 
sufficient  to  significantly  increase  the  underlying  film  thickness.  However,  when  bolus 
volumes  were  increased  (Fig.  9),  we  found  that  the  rate  of  propagation  was  virtually  unaffected. 
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Fig.9  Dimensionless  liquid  layer  thickness  (a)  and  surfactant  concentration  (b)  showing  the 
effects  of  changing  bolus  volume.  Parameter  values  are  the  same  as  in  Fig.  7  except  Hboi  -  1 .0. 
Curves  are  for  dimensionless  times  tamaUiho/(gtL<r)  -  0,  2,  10,  and  16. 


Fig.  10  Dimensionless  liquid  layer  thickness  (a)  and  surfactant  concentration  (b)  showing  the 
effects  of  changing  bolus  volume.  Parameter  values  are  the  same  as  in  Fig.  7  except  for  Bo  = 
0.1.  Curves  are  for  dimensionless  times  tomaxho/(p.Lo2)  =  0,  2,  10,  and  16. 


Moreover,  the  additional  instilled  volume  appears  to  have  remained  in  a  central  pool  at  the  site  of 
deposition  which  was  proportionately  of  greater  depth  than  the  original  case  (compare  Figs.  7 
and  9). 

Gravity  was  also  considered  by  choosing  values  of  the  Bond  number  (Bo  =  0.001  to  0.1) 
corresponding  to  a  vertically  oriented  airway  ranging  in  size  from  a  terminal  bronchiole  to  the 
trachea.  Only  in  the  tracheal  simulation  (Bo  =  0.1)  did  gravity  exert  a  significant  influence, 
causing  the  spread  to  be  noticeably  asymmetric  as  shown  in  Fig.  10.  Even  in  this  extreme  case, 
the  rate  of  spread  of  the  surfactant  was  only  slightly  increased,  suggesting  that,  at  least  for 
relatively  small  volumes  of  deposited  surfactant,  gravity  plays  a  minor  role.  This  is  clearly  not 
the  case,  however,  in  experiments  that  have  been  reported  in  which  the  volume  of  the  instilled 
bolus  is  comparable  to  that  of  the  entire  central  airway  system  1 19]  in  which  case  other  factors, 
not  represented  in  these  simulations,  take  precedence. 

Lastly,  we  investigated  the  perhaps  more  realistic  situation  of  placing  an  exogenous 
surfactant  on  a  thin  film  containing  an  endogenous  surfactant  layer  (Fig.  11).  In  this  case,  the 


propagating  wave  compresses  the  interface  ahead  of  it,  producing  a  secondary  gradient  in 
surface  tension  due  to  increased  concentration  of  the  endo^etums  surfactant.  This  generates 
flow  away  from  the  deposition  site  and  toward  more  remote  locations.  By  this  mechanism,  the 
effect  of  the  instilled  surfactant  spreads  much  more  rapidly  (compare  Figs.  7  and  1 1 ). 
However,  because  the  height  of  the  disturbance  is  correspondingly  reduced,  the  amount  of  wall 
iiquid  transported  away  from  the  deposition  site  is  actually  less  than  when  there  is  no 
endogenous  surfactant.  This  finding  is  also  true  in  cases  when  the  volume  of  the  bolus  is 
increased  with  endogenous  surfactant  present  (results  not  shown). 


z!  Ln  z/L 


Fig.  1 1.  Dimensionless  liquid  layer  thickness  (a)  and  surfactant  concentration  (b)  showing  the 
effects  of  changing  bolus  volume.  Parameter  values  are  the  same  as  in  Fig.  7  except  for  the 
presence  of  surfactant  on  the  initial  interface.  Curves  are  for  dimensionless  times 
tcwhuApLo2)  =  0,  1,2  and  3. 


These  results  can  be  used  to  predict  the  rate  of  spread  of  a  bolus  of  surfactant  deposited  in 
the  trachea  of  a  newborn  infant.  In  the  absence  of  endogenous  surfactant,  and  using  the 
parameter  values  given  in  the  caption,  surfactant  would  tiaverse  the  entire  length  of  the  trachea 
(1.5  cm)  in  just  0.2  seconds.  If  we  neglect  the  influence  of  bifurcations,  the  action  of  other 
mechanisms  such  as  the  beating  of  the  cilia  in  propelling  liquid  toward  the  mouth,  and  the  rapid 
increase  in  surface  area  per  unit  length  of  airway  with  distance  toward  the  periphery,  we 
calculate  that  this  front  would  reach  the  alveolar  zone  in  about  7  seconds.  Such  transport  rates 
help  to  explain  why  surfactants  instilled  in  the  trachea  of  infants  produce  so  favorable  an  effect, 
and  also  argues  for  die  use  of  surfactants  in  the  delivery  of  other  medications  that  need  to  be 
drawn  into  the  lung. 


6.  CONCLUSION 

In  this  paper  we  have  examined  two  problems  pertaining  to  the  stability  and  mobility  of  the 
airway  liquid  Fning  as  a  means  of  raising  a  variety  of  intriguing  and  significant  fluid  dynamic 
questions.  As  the  lung  and  airway  network  expand  and  contract  during  normal  breathing  the 
thickness  of  the  liquid  layer  varies  and  can,  if  sufficiently  thick,  become  unstable  at  low  lung 
volumes  by  a  mechanism  analogous  to  the  Rayleigh  instability  of  a  liquid  jet  and  form  a 
meniscus  that  blocks  the  airway.  If  this  blockage  persists,  gas  absorption  peripheral  to  the 
meniscus  produces  regional  atelectasis  and  a  loss  of  gas  exchange  capability.  Since  excess 
airway  liquid  and  atelectasis  are  common  manifestations  of  respiratory  disease,  this 
phenomenon  may  be  critical.  If  atelectasis  is  to  be  avoided,  the  airways,  if  closed,  must  reopen 
on  inspiration.  This  raises  questions  concerning  the  movement  and  eventual  elimination  of  the 
liquid  meniscus. 
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A  related  set  of  fluid  dynamic  issues  is  raised  by  the  presence  of  pulmonary  surfactant 
which  is  produced  in  the  gas  exchange  region  of  the  lung  but  found  in  varying  amounts 
throughout  the  airway  tree.  In  lungs  with  normal  surfactant  production,  a  surface  tension 
gradient  exists  that  has  the  capability  of  drawing  liquid  from  the  periphery  toward  the  mouth  by 
the  action  of  surface  tension  gradients.  The  importance  of  pulmonary  surfactant  is  underscored 
by  the  prevalence  of  pulmonary  diseases  in  which  surfactant  production  is  impaired  such  as 
neonatal  or  adult  respiratory  distress  syndrome  (RDS).  Instillation  of  exogenous  surfactant  is 
now  the  accepted  treatment  for  neonatal  RDS  with  generally  excellent  results.  For  a  small  but 
significant  subgroup,  however,  this  therapy  is  ineffective.  Since  the  efficacy  of  this  method 
may  depend  on  how  effectively  surfactant  is  transported  to  the  lung  periphery,  studies  are  now 
underway  to  determine  how  an  exogenous  surfactant  spreads  in  the  lung.  The  spreading  is 
effected  by  Marangoni  flows,  raising  the  exciting  prospect  of  using  surfactants  as  a  vehicle  for 
transporting  other  medications  to  the  periphery  where  they  can  be  readily  absorbed. 

The  solutions  presented  here  address  but  a  few  of  the  intriguing  fluid  dynamic  problems 
relating  to  the  dynamics  of  the  pulmonary  liquid  lining  layer.  Studies  relating  to  the  formation, 
movement,  and  reopening  of  liquid  menisci  are  essential  to  further  our  understanding  of  these 
phenomena  in  healthy  and  diseased  lungs.  Important  questions  remain  concerning  the  factors 
that  regulate  the  distribution  of  liquid  between  the  vascular,  interstitial  and  airway 
compartments.  The  movement  of  airway  liquid  through  a  multiple-generation  network  due  to 
surface  tension  gradients,  either  due  to  endogenous  or  exogenous  surfactant  needs  further  study. 
In  particular,  the  potential  role  of  instilled  surfactant  as  a  vehicle  for  drug  delivery  warrants 
investigation. 
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NOMENCLATURE 


a 

A 

Bo 

Ds 

g 

h 

Lo 

L 

M 

P 

Pgas 

S 

t 

U 

TLC 

Texp 

va 

v 

w 

W 

x 


=  radius  to  free  surface,  cm 
=  (s2-  a2)/2,  cm2 
=  Bond  number,  pghoLo/a 
=  surfactant  surface  diffusion  coefficient,  cm2/s 
=  acceleration  due  to  gravity,  cm/s2 
=  liquid  layer  thickness,  (s-a),  cm 
=  initial  monolayer  half  length,  cm 
=  airway  half-length,  cm 
=  mass  of  surfactant  per  unit  circumference,  g/cm 
=  pressure  in  liquid  film,  dyne/cm2 
=  gas  pressure  in  airway,  dyne/cm2 
=  radius  to  airway  wall,  cm 
=  time,  s 

=  radial  velocity  of  liquid  layer,  cm/s 
=  total  lung  capacity 

=  time  in  which  the  lung  is  emptied  from  TLC,  s 
=  w_a  -  xL  =  surface  axial  flow  velocity  relative  to  airway  wall,  cm/s 
=  w  -  xL  =  mean  axial  flow  velocity  relative  to  airway  wall,  cm/s 
=  axial  velocity  of  airway  film,  cm/s 
=  mean  axial  velocity  of  airway  film,  cm/s 
=  z/L 


Q  -O  T=  P 
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2 


®max 


=  axial  coordinate  in  airway,  cm 

=  slope  of  the  linear  equation-of-state  for  a  vs.  r,  cm2/s2 
=  [1 +((l/L)3a/dx)2]-i/2 
=  dimensionlet,  >  surfactant  concentration 
=  viscosity  of  airway  liquid,  dyne  s/cm2 
*  density  of  airway  liquid,  g/cm3 
=  surface  tension,  dynes/cm 
=  maximum  value  for  a 


Subscripts 

a  ~  (r=a) 

s  ~  (r=s) 
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Abstract 

Animals  use  legs  and  flapping  wings  instead  of  the  wheels 
and  rotors  of  man-made  vehicles.  Some  largely  avoid  the 
energetic  penalties  of  using  reciprocating  instead  of  rotary 
motion  by  means  of  springs  which  store  the  kinetic  energy  that 
is  lost,  as  the  body  or  a  limb  decelerates,  and  return  it  in  an 
elastic  recoil.  Pendulum  mechanisms  involving  exchange  of 
kinetic  energy  with  gravitational  potential  energy  provide 
alternative  means  of  saving  energy  at  low  speeds. 


1.  INTRODUCTION 

Energy  saving  is  important  to  animals,  in  many  ways.  An 
animal  may  be  unable  to  obtain  more  than  a  limited  amount  of 
food  (and  so  energy)  each  day.  Alternatively,  more  food  may  be 
available  only  at  the  cost  of  taking  risks,  or  of  having  less 
time  available  for  other  activities  such  as  reproduction.  An 
animal  that  moves  economically  may  be  able  to  run  faster  or 
further  for  the  same  energy  cost,  so  may  be  better  at  catching 
prey  or  escaping  from  enemies.  The  structures  and  patterns  of 
movement  on  which  economy  of  energy  depends  are  largely 
inherited,  so  can  be  modified  by  evolution  -  other  things  being 
equal,  animals  that  are  more  economical  of  energy  are  more 
likely  to  survive  and  have  offspring.  This  paper  shows  some  of 
the  many  energy-saving  devices  that  have  evolved  in  animals. 

Unfortunately,  we  have  to  start  with  an  awkward 
complication.  It  is  often  relatively  easy  to  predict  from 
engineering  mechanics  how  a  change  in  structure  or  pattern  of 
movement  will  affect  the  mechanical  power  needed  for  locomotion. 
However,  the  costs  that  drive  evolution  are  not  work,  but  fuel 
(food)  consumption.  We  want  to  know  the  effects  of  changes  not 
on  mechanical  power  but  on  metabolic  power,  the  rate  at  which 
food  energy  (measured  as  heat  of  combustion)  is  used. 

We  might  hope  to  be  able  to  calculate  metabolic  from 
mechanical  power  by  dividing  by  some  constant  efficiency,  but 
that  approach  would  lead,  in  some  cases,  to  serious  errors. 
Metabolic  energy  must  of  course  be  used  when  muscles  do  work 
(conservation  of  energy  requires  it)  but  it  is  also  used  when 
a  muscle  exerts  tension  while  maintaining  constant  length  and 
so  doing  no  work.  Muscle  consists  of  an  array  of  inter- 
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digitating  protein  filaments  that  can  slide  past  each  other, 
allowing  it  to  lengthen  and  shorten  like  a  telescope.  It  is 
made  to  shorten  by  crossbridges  between  the  filaments  which 
attach,  pull  and  detach  like  a  team  of  men  pulling  in  a  rope 
hand-over-hand.  These  cycles  of  attachment  and  detachment 
continue  as  long  as  tension  is  being  exerted,  whether  the  muscle 
is  shortening  or  not,  and  each  cycle  costs  energy. 

The  relationship  between  mechanical  performance  and 
metabolic  energy  cost  is  imperfectly  understood  (Ma  &  Zahalak, 
1991;  Alexander,  1991),  but  to  a  first  approximation  we  can 
think  of  metabolic  cost  as  the  sum  of  two  components: 

(i)  the  cost  of  work,  proportional  to  the  work  performed;  and 

(ii)  the  cost  of  force,  proportional  to  the  number  of  active 
crossbridges  multiplied  by  the  number  of  cycles  made  by  each  of 
them.  The  number  of  crossbridges  is  proportional  to  the  volume 
of  muscle  that  has  to  be  activated  to  exert  the  force,  and  the 
rate  at  which  they  cycle  is  higher  in  muscles  that  can  shorten 
fast  than  in  muscles  that  can  contract  only  slowly.  Thus 
evolution  can  be  expected  to  favour  changes  that  reduce  the  work 
that  muscles  have  to  do,  that  reduce  the  volume  of  muscle 
involved,  or  that  enable  the  animal  to  use  slower,  more 
economical  muscles. 


2.  RUNNING 

The  energy  cost  of  running  is  particularly  interesting  for 
the  paradoxical  reason  that  very  little  net  work  is  needed,  to 
run  at  constant  speed  over  level  ground.  Work  is  of  course 
required  to  overcome  aerodynamic  drag,  and  friction  in  the 
joints,  but  the  quantities  are  only  tiny  fractions  of  the 
metabolic  energy  requirement  (Alexander,  1977).  Additional 
energy  costs  arise  because  animals  travel  on  legs  rather  than 
wheels.  It  is  difficult  to  imagine  how  wheels  could  have 
evolved  but  La  Barbera  (1983)  has  argued  that  legs  are  in  any 
case  preferable  largely  because  of  their  capability  of  travel 
over  irreqular  terrain. 

To  understand  the  additional  costs  of  travel  on  legs  we 
must  take  account  not  only  of  the  positive  work  that  muscles  do 
when  they  shorten  while  exerting  force,  but  also  of  the  negative 
work  done  when  they  are  forcibly  lengthened,  degrading 
mechanical  energy  to  heat.  One  muscle  in  a  leg  may  do  positive 
work  while  another  does  negative  work.  Alternatively,  muscles 
may  do  positive  work  at  one  stage  of  the  stride  and  negative 
work  at  another.  In  either  case,  the  sum  of  all  the  quantities 
of  positive  work  performed  in  a  journey  may  greatly  exceed  the 
net  work  requirement.  Similarly,  the  fuel  consumption  of  your 
car  is  increased  if  you  drive  with  the  brakes  on. 

The  most  obvious  additional  cost  of  travel  on  legs  arises 
because  wheels  rotate  but  legs  reciprocate.  The  kinetic  energy 
of  an  animal  (or  of  any  other  system  of  masses)  can  be  thought 
of  as  the  sum  of  external  kinetic  energy  associated  with  the 


movement  of  the  centre  of  mass  and  internal  kinetic  energy 
associated  with  movement  of  parts  relative  to  the  centre  of  mass 
(see,  for  example,  Alexander,  1988).  The  internal  kinetic 
energy  of  a  wheeled  vehicle  may  be  constant,  when  it  travels  at 
constant  speed,  but  that  of  a  legged  animal  rises  and  falls  as 
its  legs  swing  backward  and  forward.  Positive  work  must  be  done 
on  each  leg  twice  in  each  stride,  and  negative  work  must  be  done 
twice,  to  keep  it  swinging.  The  quantities  involved  can  be 
reduced  by  reducing  the  moment  of  inertia  of  the  limb  about  its 
proximal  end.  Specialised  running  animals  such  as  horses  and 
antelopes  have  large  muscles  in  the  upper  parts  of  their  legs, 
where  their  mass  contributes  little  to  the  moment  of  inertia, 
while  their  feet  are  slender  and  correspondingly  light. 

A  pendulum  can  be  kept  swinging  with  very  little  energy 
input,  because  potential  energy  changes  compensate  for  the 
fluctuations  of  kinetic  energy.  Legs  may  swing  forward 
passively  in  walking  (Mochon  &  McMahon,  1980;  McGeer,  1992)  but 
at  higher  speeds  they  have  to  be  driven  well  above  their 
pendulum  frequency.  We  will  examine  later  the  possibility  that 
legs  may  be  spring -mounted,  so  as  to  vibrate  at  the  appropriate 
frequency,  but  it  will  be  convenient  to  leave  that  until  after 
discussing  other,  more  firmly  established,  elastic  mechanisms. 

We  have  been  discussing  energy  costs  that  arise  because 
legs  have  mass  and  make  reciprocating  movements.  We  turn  now 
to  other  energy  costs.  Fig.  1  shows  some  simple  wolking  devices 
which  will  be  assumed  in  the  discussion  that  follows,  to  have 
legs  of  zero  mass. 

In  Fig.  1(a)  each  leg  has  two  joints,  powered  by 
appropriate  actuators  :  a  hinge  joint  at  the  proximal  end,  which 
enables  the  leg  to  be  pointed  in  any  required  direction  within 
a  plane,  and  a  sliding  joint  which  allows  changes  of  leg  length. 
It  might  have  seemed  more  realistic  to  have  drawn  a  leg  with  two 
hinges  (a  "hip"  and  "knee")  but  the  arrangement  shown  allows 
the  same  number  of  degrees  of  freedom  of  movement.  Suppose 
that  this  device  walks  over  level  ground  keeping  the  forces  on 
its  feet  vertical  and  equal  to  body  weight.  Suppose  also  that 
the  moment  of  inertia  of  its  body  is  so  high  that  the  amplitude 
of  pitching  movements  is  negligible.  In  that  case,  kinetic  and 
potential  energy  will  remain  constant  throughout  the  stride  and 
no  net  work  will  be  required  at  any  stage,  except  to  replace 
frictional  losses.  However,  at  the  stage  shown  in  Fig.  1(a) (i) 
the  hinge  actuator  is  doing  positive  work  and  the  sliding  joint 
actuator  negative  work;  while  at  the  stage  shown  in  Fig. 
1(a)  (ii)  the  reverse  is  the  case.  Similarly,  if  a  leg  with  hip 
and  knee  joints  walked  in  this  way,  the  actuator  of  one  joint 
would  have  to  do  work  against  the  actuator  of  the  other ,  during 
much  of  the  stride. 

This  wastage  of  energy  could  be  avoided  by  having  legs  with 
two  sliding  joints  as  shown  in  Fig.  1(b)  ,  or  by  more  complicated 
pantograph  mechanisms  such  as  have  been  used  in  some  legged 
robots  (Vlaldon  et  al.  ,  1984),  but  I  know  no  cases  of  animals 
saving  energy  in  this  way. 


Fag.  1  Diagrams  of  simple  walking  devices 


Another  walking  style  that  avoids  having  actuators  working 
against  each  other  is  shown  in  Fig.  1(c).  The  torque  actuator 
at  the  proximal  end  of  the  limb  remains  inactive,  so  the  ground 
force  is  aligned  with  the  leg.  The  sliding  joint  actuator  does 
negative  work  at  the  stage  of  the  stride  shown  in  Fig.  1(c) (i), 
reducing  the  kinetic  energy  of  the  body;  and  positive  work  at 
the  stage  shown  in  Fig.  1(c) (ii),  restoring  the  kinetic  energy. 
Thus  positive  work  done  at  one  stage  is  counteracted  by  negative 
work  done  at  another,  but  it  is  easily  shown  that  the 
quantities  of  work  are  less  than  in  the  style  of  walking  shown 
in  Fig.  1(a),  for  all  reasonable  step  lengths.  Alexander  ( 1 9 >1 ) 
shows  similarly  for  a  leg  with  two  hinge  joints,  that  positive 
and  negative  work  requirements  can  be  minimized  by  aligning  the 
ground  force  with  the  proximal  joint.  The  forces  exerted  on  the 
ground  in  walking  and  running  by  dogs,  sheep  and  other 
vertebrates  tend  to  be  aligned  with  the  proximal  limb  joints, 
or  with  points  above  them  (Jayes  &  Alexander,  1978). 

Even  this  reduced  work  requirement  can  be  eliminated 
(leaving  only  the  work  needed  to  replace  frictional  losses)  if 
the  actuator  of  the  sliding  joint  is  replaced  by  a  spring  :  the 
theory  is  presented  by  McMahon  &  Cheng  (1990).  A  vehicle 
bouncing  along  on  sprung  legs  must  rise  and  fall,  accelerate  and 
decelerate,  but  the  fluctuations  of  kinetic  energy  plus 
gravitational  potential  energy  are  balanced  by  fluctuations  of 
elastic  strain  energy. 

Some  of  the  muscles  in  the  legs  of  mammals  have  long 
tendons  in  series  with  them.  These  serve  as  springs,  and  have 


excellent  elastic  properties  :  Bennett  et  al . ,  (1986)  show  that 
when  tendons  are  stretched  to  physiologically  relevant  stresses, 
and  allowed  to  recoil,  percentage  energy  dissipation  is  no  more 
than  about  7%.  The  forces  on  leg  tendons  and  the  extent  to 
which  they  stretch  in  running,  have  been  calculated  for  many 
mammals  (Alexander,  1988)  but  the  most  thorough  studies  are  of 
kangaroos,  most  recently  by  Griffiths  (1989).  He  measured  the 
force  exerted  by  the  gastrocnemius  (calf)  muscle  and  its  changes 
in  length,  as  the  animal  hopped.  He  showed  that  it  stretched 
by  about  12  mm  as  the  force  increased  and  shortened  as  the  force 
fell,  as  if  it  were  a  passive  spring.  He  also  measured  the 
elastic  compliance  of  its  tendon  and  showed  that  41%  of  the 
negative  and  positive  work  was  attributable  to  passive 
stretching  and  recoil  of  the  tendon,  leaving  only  59%  of  the 
work  to  be  done  by  the  muscle  fibres. 


Fig.  2  The  force  on  the  foot  Fig.  3  At  this  stage 

of  a  human  runner,  calculated  of  a  galloping  stride, 

from  force  plate  records.  all  four  legs  change 

their  direction  of 
movement  and  energy 
can  be  saved  by 
elastic  extension  and 
recoil  of  a  sheet  of 
tendon  in  the  back. 

The  ligaments  in  the  arch  of  the  human  foot  form  an 
additional  spring.  Fig.  2  shows  a  foot  at  the  stage  of  a 
running  stride  at  which  the  forces  are  greatest.  The  ground 
force  and  its  point  of  application  (on  the  ball  of  the  foot)  are 
known  from  force  plate  records.  The  force  in  the  Achilles 
tendon  and  the  reaction  at  the  ankle  joint  have  been  calculated. 
These  three  forces  flatten  the  arch,  storing  strain  energy  in 
the  ligaments.  Experiments  with  amputated  feet  (Ker  et  al .  , 
1987)  show  that  the  arch  is  a  reasonably  good  spring,  with 
about  20%  energy  dissipation,  and  enable  us  to  calculate  the 
strain  energy  stored  in  it.  It  seems  that  of  the  (kinetic  plus 
potential)  energy  lost  and  regained  in  each  step,  about  35%  is 
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stored  as  elastic  strain  energy  in  the  Achilles  tendon  and  about 
17%  in  the  arch  of  the  foot-  Thus  half  of  the  negative  and 
positive  work  requirement  is  accommodated  by  deformation  and 
recoil  of  springs,  leaving  only  half  to  be  supplied  (at 
metabolic  energy  cost)  by  muscles.  The  muscles  that  attach  to 
the  human  Achilles  tendon  have  fibres  about  50  mm  long,  but 
muscles  associated  with  tendon  springs  in  more  specialized 
running  mammals  have  much  shorter  fibres.  For  example  one  of 
the  digital  flexor  muscles  in  the  fore  leg  of  tne  horse  has 
muscle  fibres  only  about  3  mm  long  attached  to  a  7  00  mm  tendon. 
Analysis  of  films  of  galloping  horses  shows  that  this  muscle- 
plus-tendon  lengthens  and  shortens  by  about  50  mm  in  each 
stride  (Dimery,  Alexander  &  Ker,  1986).  The  vestigial  muscle 
fibres  can  contribute  very  little  to  this  length  change,  which 
must  be  almost  entirely  due  to  passive  stretching  and  recoil  of 
the  tendon.  The  advantage  of  the  reduced  size  of  the  muscle 
fibres  is  presumably  that  very  little  metabolic  energy  is  needed 
to  develop  tension  in  them. 

An  alternative  means  of  saving  energy  is  to  balance 
kinetic  energy  fluctuations  against  gravitational  potential 
energy  fluctuations.  In  Fig.  1(d)  the  length  of  the  leg  is  kept 
constant  and  the  ground  force  is  kept  aligned  with  the  leg.  At 
the  illustrated  stages  of  the  stride,  kinetic  and  potential 
energy  changes  are  balanced  and  no  work  is  required  of  either 
actuator.  However,  work  is  required  when  one  foot  is  lifted  and 
the  other  set  down  (McGeer,  1992).  Alexander  (1993)  has  shown 
that  gaits  involving  exchange  of  kinetic  with  gravitational 
potential  energy  can  be  economical  at  low  speeds,  and  that  human 
walking  is  close  to  making  the  best  possible  use  of  this 
principle. 

I  return  now  to  the  kinetic  energy  changes  associated  with 
forward  and  backward  swinging  of  the  legs.  When  quadrupeds 
trot,  each  fore  leg  moves  in  synchrony  with  the  diagonally 
opposite  hind  leg.  At  higher  speeds  mammals  gallop,  swinging 
the  fore  legs  back  while  the  hind  legs  swing  forward  and  vice 
versa.  Fig.  3  shows  a  stage  of  the  strike  at  which  all  four 
legs  change  their  directions  of  movement  and  internal  kinetic 
energy  passes  through  a  minimum.  Alexander,  Dimery  &  Ker  (1985) 
have  shown  that  much  of  the  lost  kinetic  energy  may  be  stored 
as  elastic  strain  energy  in  a  sheet  of  tendon  in  the  back, 
saving  metabolic  energy.  There  is  another  stage  of  the  stride 
at  which  the  legs  are  at  the  opposite  extremes  of  their  ranges 
of  movement,  at  which  it  would  again  be  useful  to  store  kinetic 
energy  taken  from  them,  as  elastic  strain  energy.  Bennett 
(1989)  has  identified  a  sheet  of  tendon  in  the  thigh  as  a 
possible  store  for  some  of  this  energy. 


3 .  FLIGHT 

When  insects,  small  birds  and  bats  hover  they  beat  their 
wings  back  and  forward  in  a  near-horizontal  plane,  driving  air 
downwards  in  the  manner  of  a  helicopter  rotor.  When  they  fly 


fast  they  beat  their  wings  up  and  down,  adjusting  their  angle 
of  attack  in  such  a  way  as  to  supply  not  only  the  upward  force 
needed  to  counteract  the  body's  weight  but  also  the  thrust 
needed  to  overcome  drag.  In  either  case,  we  have  a 
reciprocating  motion  :  kinetic  energy  must  be  supplied  to  the 
wings  twice  in  each  wing  beat  cycle,  and  removed  twice.  This 
may  add  little  or  nothing  to  the  work  required  of  the  muscles, 
if  the  inertial  work  required  to  accelerate  the  wings  is  small 
compared  to  the  aerodynamic  work  required  for  flight.  Work  done 
to  accelerate  the  wing  at  the  beginning  of  a  stroke  may  be 
recovered  as  useful  work  done  against  the  aerodynamic  forces 
that  decelerate  it  at  the  end  of  the  stroke.  However,  if  the 
inertial  work  requirement  is  large,  some  of  it  may  have  to  be 
dissipated  by  muscles  doing  negative  work  at  the  end  of  the 
stroke. 

The  inertial  work  is  relatively  large  in  slow  and  hovering 
flight.  Ellington  (1984)  calculated  for  hovering  by  various 
insects  that  the  ratio  of  inertial  to  aerodynamic  work  lay 
between  0.7  and  3.  In  such  cases,  worthwhile  energy  savings  can 
be  made  by  elastic  mechanisms.  This  could  be  done  by  placing 
springs  of  appropriate  stiffness  either  in  parallel  or  in  series 
with  the  muscles.  Springs  in  parallel  would  reduce  the  forces 
required  of  the  muscles  without  affecting  their  length  changes 
while  springs  in  series  would  do  the  reverse.  Bennett,  Ker  & 
Alexander  (1987)  show  that  the  two  cases  require  springs  of 
different  stiffness.  Weis-Fogh  (1960)  described  parallel 
springs  in  the  wing  hinges  of  locusts  and  a  series  spring  on  a 
smaxx  muscle  in  dragonflies.  However,  in  advanced  insects 
(bees,  flies,  bugs,  beetles  etc.)  the  muscles  themselves  serve 
as  springs.  This  was  demonstrated  by  Machin  &  Pringle  (1959), 
who  showed  that  muscles  from  such  insects,  mounted  in  apparatus 
that  simulated  the  loads  that  would  act  on  them  in  flight,  could 
be  made  to  behave  like  springs  with  negative  hysteresis  : 
instead  of  dissipating  energy  in  each  cycle  of  extension  and 
shortening,  they  did  work  in  each  cycle. 

The  cross-bridges  seem  to  be  the  springs  in  these  insect 
muscles.  In  Machin  &  Pringle's  (1959)  experiments  the  muscle 
fibres  were  made  to  lengthen  and  shorten  by  about  5%  (peak  to 
peak) .  Such  small  length  changes  might  be  accommodated  by 
stretching  of  cross-bridges,  without  detachment  :  when  a  cross¬ 
bridge  detaches  any  strain  energy  stored  in  it  is  lost.  The 
muscles  are  believed  to  work  over  similarly  small  length  ranges 
in  the  intact  flying  insect.  Because  the  range  is  small,  a 
large  volume  of  muscle  has  to  be  activated  to  supply  the  work 
for  each  wing  beat,  so  it  may  seem  doubtful  whether  metabolic 
energy  is  saved  in  comparison  with  possible  alternative 
arrangements  that  would  not  exploit  muscle  elasticity. 

The  wing  muscles  of  birds  are  arranged  differently  from 
those  of  the  insects  we  have  been  discussing,  with  much  large 
velocity  ratios.  Consequently,  they  have  to  make  length  changes 
of  about  20%  peak  to  peak  (Cutts,  1986)  and  there  is  little 
scope  for  exploiting  the  elastic  properties  of  cross  bridges. 
Pennycuick  &  Lock  (1976)  suggested  that  useful  energy  savings 


might  be  made  by  storage  of  strain  energy  in  bent  feathers,  but 
Alexander  (1988)  argued  that  the  mechanism  could  not  work  when 
it  would  be  most  useful,  in  hovering.  Jenkins,  Dial  &  Goslow 
(1988)  Look  X-ray  cine  film  of  a  starling  flying  in  a  wind 
tunnel  and  showed  that  the  width  between  the  shoulders  increases 
and  decreases  in  each  wing  beat  cycle.  They  remarked  that  the 
furcula  (wishbone)  was  behaving  like  a  spring,  but  it  is  easy 
to  show  from  data  in  their  paper  that  the  strain  energy  stored 
in  it  is  only  about  5%  of  the  inertial  work  required  to 
accelerate  the  wings.  So  far,  no  spring  has  been  identified 
that  could  have  an  important  energy-saving  role  in  bird  flight. 


4 .  SWIMMING 

Fishes  swim  by  beating  their  tails  from  side  to  side,  and 
whales  beat  their  tail  flukes  up  and  down.  Inertial  work  must 
be  done  at  the  beginning  of  each  stroke,  and  there  may  be  scope 
for  energy  saving  by  elastic  storage.  Bennett,  Ker  &  Alexander 
(1987)  thought  that  the  long  tendons  in  the  tails  of  dolphins 
might  serve  as  in-series  springs,  but  their  (admittedly  very 
rough)  calculations  indicated  that  they  were  too  compliant  to 
save  energy.  Wainwright,  Vosburgh  &  Hebrank  (19'78)  suggested 
that  the  mesh  of  helically-wound  collagen  fibres  in  fish  skin 
might  be  a  useful  spring,  but  its  geometry  makes  it  difficult 
to  estimate  how  much  strain  energy  is  stored  (Alexander,  1987) 
and  the  possibility  remains  open. 
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Abstract 

Through  the  use  of  a  feedback  control  strategy,  one  can  profoundly  alter  the 
bifurcation  structure  of  convective  flows.  For  instance,  with  the  use  of  a  feedback 
controller,  one  can  maintain  the  no-motion  state  in  fluid  layers  heated  from  below 
and  cooled  from  above  at  significantly  higher  temperature  differences  than  the 
critical  one  corresponding  to  the  onset  of  convection  in  an  uncontrolled  system. 
Likewise,  one  can  maintain  steady,  time-independent  flow  under  conditions  in 
which  the  flow  would  otherwise  be  chaotic.  Similarly,  one  can  stabilize  periodic 
non-stable  orbits  which  exist  in  the  chaotic  regime  of  the  uncontrolled  system. 
Finally,  the  controller  also  can  be  used  to  destabilize  otherwise  stable  flows  and 
thus  induce  chaos  in  laminar,  non-chaotic  flow.  In  this  paper,  I  will  discuss  the 
progress  that  has  been  made  in  achieving  the  foregoing  in  the  convective  loop  and 
the  saturated  porous  layer  paradigms. 

1.  INTRODUCTION 

In  recent  years,  active  control  strategies  have  been  successfully  used  for, 
among  other  things,  noise  reduction  and  vibration  suppression.  The  area  of  active 
control  of  fluid  flow  is  no  less  important  from  a  technological  point  of  view;  for  in 
many  situations,  it  may  be  desirable  to  maintain  flow  conditions  other  than  the 
naturally  occurring  ones.  Below,  I  mention  a  few  examples. 

In  some  situations,  convection  suppression  may  be  desirable.  For  instance, 
when  a  crystal  is  pulled  from  a  liquid  melt  heated  from  below,  it  may  be  desirable  to 
operate  at  Rayleigh  numbers  higher  than  the  critical  one  at  which  convection 
normally  begins  and  yet  have  no  convection  so  as  to  minimize  transport  of 
contaminants  from  the  crucible  walls  to  the  crystal. 

In  other  situations,  suppression  (laminarization)  of  chaotic  or  turbulent 
motion  and  maintenance  of  a  steady,  time-independent  flow  may  be  desirable  in 
order  to  minimize  flow  unpredictability,  remove  temperature  oscillations  which  may 
exceed  safe  operational  conditions,  and/or  reduce  drag.  For  instance,  stabilizing 
the  Blasius  profile  and  delaying  the  laminar-turbulent  transition  in  the  boundary 
layer  flow  may  provide  a  means  for  reducing  aircraft  drag  and  thus  obtaining 
significant  savings  in  fuel  cost.  Likewise,  suppressing  oscillatory  convection  in 
crystal  growth  processes  may  lead  to  an  improvement  in  crystal  quality. 

In  yet  other  situations,  it  may  be  desirable  to  stabilize  one  of  the  time- 
periodic  orbits  embedded  in  the  turbulent  attractor  in  order  to  achieve  a  desired 


188 


flow  behavior.  Indeed,  from  the  purely  scientific  point  of  view,  the  ability  to  stabilize 
and  observe  various  non-stable,  periodic  orbits  embedded  within  the  chaotic 
attractor  may  help  enhance  our  understanding  of  the  chaotic  dynamics  and  assist 
us  in  describing  various  important  properties  of  the  chaotic  attractor. 

In  still  other  situations,  it  may  be  advantageous  to  induce  chaos,  under 
conditions  in  which  it  would  not  normally  occur,  so  as  to  enhance  mixing,  heat 
transport  or  chemical  reactions  in  combustion  chambers  or  chemical  reactors. 

Surprisingly  given  all  the  foregoing,  the  idea  of  altering  the  bifurcation 
structure  of  flows  through  the  use  of  active  control  has  received  relatively  little 
attention.  Most  studies  to  date  have  focused  on  passive  control  methods. 

Passive  control  strategies  do  not  require  a  direct  investment  of  energy  and 
they  are  usually  accomplished  by  changing  the  geometry  of  the  solid  boundaries. 
For  example,  mixing  can  be  enhanced  through  the  use  of  pipes  with  sinusoidally 
modulated  walls  rather  than  straight  pipes  or  by  introducing  spoilers  in  the  flow. 
Boundary  layer  transition  to  turbulence  can  be  delayed  by  using  perforated  solid 
walls  to  facilitate  suction  (Bushnell  and  McGinley,  1989);  and  thermal  convection 
can  be  suppressed  by  introducing  buffers  in  cavities.  Passive  devices  may  have 
indirect  energy  effects  as  their  presence  may  affect  the  amount  of  energy  needed  to 
propel  the  fluid  or  to  propel  an  object  submerged  in  the  fluid. 

In  contrast,  active  control  strategies  require  a  direct  investment  of  energy  for 
actuation  purposes.  Examples  of  active  control  include  inducement  of  periodic 
motion  of  the  solid  boundaries,  suction  and/or  blowing,  induction  of  acoustic 
pressure  waves,  and  heating  and  cooling  of  the  fluid  to  affect  its  viscosity.  See,  for 
example,  Liepman  et.  al.  (1982)  and  Liepmann  and  Nosenchuck  (1982)  who 
advocated  the  use  of  thermal  actuation  to  suppress  or  enhance  Toilmien- 
Schlichting  waves  in  laminar  boundary  layer  flow.  Active  control  strategies  can  be 
further  sub-classified  into  predetermined  and  feedback. 

Predetermined  control  consists  of  pre-programmed  actuation  designed  to 
accomplish  a  desired  outcome.  The  actuation,  however,  is  not  sensitive  to  the 
fluid's  behavior.  For  example,  one  may  periodically  modulate  the  motion  of  a 
cavity's  boundary  to  induce  chaotic  advection  and  enhance  mixing  (Ottino,  1991). 
Another  instance  of  predetermined  control  is  the  utilization  of  magnetic  fields  or 
time-periodic  modulation  of  wall  temperature  to  suppress  convective  processes. 

In  contrast,  a  feedback  controller  reacts  to  events  occurring  in  the  fluid  so  as 
to  suppress  or  enhance  naturally  occurring  instabilities.  To  make  the  concept  of 
feedback  control  more  concrete,  let's  consider  the  Rayleigh-Benard  problem  of  a 
horizontal  fluid  layer  heated  from  below  and  cooled  from  above  (Fig.  1).  In  the 
classical  problem,  as  the  temperature  difference  between  bottom  and  top 
(expressed  in  non-dimensional  form  as  the  Rayleigh  number)  increases,  the  state 
of  the  system  changes  from  no-motion  to  time-independent  motion  to  time- 
dependent,  possibly  chaotic,  motion.  These  transitions  occur  because  once  the 
Rayleigh  number  exceeds  certain  critical  values,  conditions  are  ripe  for  instabilities 
to  manifest  themselves.  The  objective  of  the  feedback  controller  is  to  enhance  the 
dissipating  mechanisms  so  as  to  suppress  the  naturally  occurring  instabilities.  For 
example,  Fig.  1  describes  the  control  strategy  (dealt  with  more  fully  later  in  the 
paper)  we  used  to  delay  the  transition  from  the  no-motion  to  the  time-independent 
motion  state.  In  this  example,  a  sensor  measures  the  fluid's  temperature  at  mid¬ 
height  and  the  bottom  temperature  is  slightly  modified  in  proportion  to  the  deviation 
of  the  sensed  temperature  from  its  desired  (no-motion)  value.  At  locations  where 
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the  flow  is  hotter  than  usual  and  the  fluid  tends  to  ascend,  the  boundary 
temperature  is  reduced  to  assist  in  dissipating  the  excess  heat.  The  reverse  occurs 
at  locations  where  the  fluid  descends.  This  strategy  can  be  used  to  significantly 
increase  the  magnitude  of  the  critical  Rayleigh  number  for  the  onset  of  convection. 


tfx,y,0/)  - 1 "  <?(x,;yA5,r) - 0  5) 

Fig.  1  A  schematic  description  of  the  use  of  feedback  control  to  stabilize  the  no¬ 
motion  state  in  the  Rayleigh-Benard  problem.  0  is  the  non-dimensional 
temperature;  x  and  y  are  coordinates  in  the  horizontal  platform;  z  is  a  vertical 
coordinate;  t  is  time;  and  K  is  the  controller's  gain. 


For  the  remainder  of  this  paper,  I  will  focus  on  the  use  of  active  feedback 
control  strategies  to  modify  the  nature  of  convective  flows. 

2.  GENERAL  CONCEPTS 

In  this  section,  I  will  present  the  general  concepts  underlying  feedback 
control  utilizing  mathematical  notation.  First,  let’s  consider  the  dynamic  system 
(i.e.,  the  Navier-Stokes  equations), 

“  =  F  (•£,;/?)  ^  j 

where  E  is  a  vector  of  dependent  variables  (i.e.,  velocity  components,  temperature, 
and  density);  superscript  dot  denotes  the  time  (t)  derivative;  R  is  the  parameters’ 
vector  (i.e.,  the  Reynolds  and/or  Rayleigh  number);  and  F  is  a  non-linear  (possibly 

differential)  operator  with  the  appropriate  boundary  conditions.  Let  E=Eo  denote 
an  equilibrium,  non-transient  (i.e.,  time-independent,  time-periodic  or  quasi- 
periodic)  solution  of  (1).  For  example,  Eo  may  be  one  of  the  non-stable  periodic 
orbits  embedded  in  the  chaotic  attractor.  As  R  is  varied,  equilibrium  states  may 
lose  stability  at  bifurcation  points  and  give  rise  to  new,  non-transient,  stable 
solutions.  Often  several  successive  bifurcations  will  eventually  result  in  chaotic 
behavior.  Eo  may  be  either  a  stable  solution  for  a  limited  range  of  the  parameters' 
R  (say,  for  R-j<R<R2)  or  a  non-stable  solution  for  all  R  values.  Let  us  assume  that 
for  a  particular  process,  a  selected  So  has  better  properties  than  the  normally 
occurring  flow  structure.  For  example,  in  Rayleigh-Benard  convection,  So  may 
represent  the  no-motion  state  which  we  wish  to  maintain  under  conditions  in  which 
convection  would  normally  occur.  In  turbulent  boundary  layer  flow,  So  may 


represent  either  the  laminar,  time-independent  state  or  one  of  the  time-periodic 
orbits  embedded  in  the  turbulent  attractor.  Our  objective  is  to  render  Eo  stable  in 
regions  of  the  parameter  space  (R)  in  which  it  would  normally  be  non-stable. 

2.1  DIRECT  APPLICATION  OF  FEEDBACK  CONTROL 

Feedback  control  is  implemented  by  augmenting  the  system  (1)  with  the 
controller  C(  ): 

E  =  G(E;  R,K)  =  F(E;R)  +  C(E-E0;R,K)t  (2) 

where  C(),  the  controller's  output,  is  either  a  linear  or  a  non-linear  operator  with 
C(0;  R,  K)=0  for  all  R  and  K  values;  K  is  the  controller's  gain  vector;  (E-Eo)  is  the 
deviation  of  the  measured  variables  from  their  desired  values;  and  Eq  is  an 
equilibrium  solution  which  may  be  stable  for  some  range  of  the  parameters  R  or 
non-stable  for  all  R  values.  In  the  former  case,  the  controller  may  be  used  to  move 
bifurcation  points  in  parameter  space  (i.e.,  delay  the  bifurcation  to  higher  values  of 
R),  thus  delaying  the  appearance  of  chaos.  In  the  latter  case,  the  controller  can  be 
used  to  stabilize  states  which  otherwise  would  be  non-stable  for  all  values  of  the 
parameter  R  such  as  non-stable  periodic  orbits  embedded  in  a  chaotic  attractor. 

Note  that  E=Eo  is  a  solution  of  both  the  original  system  (1)  and  the 
augmented  system  (2).  That  is,  the  controller  preserves  the  system's  equilibria.  In 
other  words,  the  controller  attempts  to  preserve  the  stability  of  Eo  in  regions  of 
parameter  space  in  which  Eo  would  normally  be  non-stable.  The  proposed 
method  of  control  is  not  the  only  possible  one.  Alternatively,  one  might  attempt  to 
achieve  the  desired  flow  behavior  by  generating  a  new  flow  structure  (En)  which  is 
an  equilibrium  state  of  (2)  but  not  of  (1),  i.e.,  C(E-En;  R,K)*0.  Such  a  control 
strategy  might  conceivably  be  necessary  if  none  of  the  equilibrium  states  (Eo)  of  (1 ) 
has  the  desired  behavior.  However,  an  attempt  to  induce  a  flow  structure  (En) 
which  is  not  an  equilibrium  solution  of  (1)  may  require  a  fair  amount  of  power 
investment.  In  contrast,  the  stabilization  of  Eo  in  a  noise-free  environment  requires 
an  infinitesimal  amount  of  power.  Here,  we  discuss  only  the  stabilization  of 
equilibrium  solutions. 

In  practice,  the  controller  C(  )  can  be  implemented  by  manipulating  the 
system's  boundary  conditions.  For  example,  one  can  equip  some  of  the  system's 
surfaces  with  sensors  and  actuators. 

In  relatively  simple  cases,  the  analysis  of  the  controlled  system  (2)  can  be 
carried  out  using  tools  of  stability  theory  to  establish  the  effects  of  controller  gains 
on  the  bifurcation  structure  of  the  system.  In  more  difficult  cases,  one  might  utilize  a 
"black  box"  approach  and  implement  the  control  without  knowing  either  the 
bifurcation  structure  of  the  system  (1)  or  the  mathematical  model  F(-).  One  does 
need,  however,  to  have  an  estimate  for  Eg. 
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2.2  TRAJECTORY  DIRECTING 

A  somewhat  different,  but  related,  control  strategy  based  on  analyzing  the 
behavior  of  a  system's  trajectories  in  phase  space  has  been  advanced  by  Ott  et.  al. 
(1990a  and  b).  The  major  advantage  of  this  technique  is  that  it  provides  a 
procedure  to  implement  control  when  the  mathematical  model  for  the  system  is  not 
known  or  is  too  complex  to  analyze.  The  method  has  been  used  thus  far  only  in 
low-dimensional  systems.  The  procedure  requires  the  identification  of  the  non¬ 
stable  periodic  orbit  (P),  which  one  wishes  to  stabilize  and  which  is  embedded  in 
the  chaotic  attractor,  and  its  linearly  stable  (E^p)  and  unstable  (Eup)  manifolds. 
This  can  be  done  by  using  embedding  to  reconstruct  the  phase  space  of  the 
attractor  from  a  time  series  of  one  of  the  system’s  variables  obtained  by  numerical 
or  physical  experiments.  Subsequently,  one  can  construct  a  return  map  to  a 
selected  (Poincare)  cross-section  obtained  by  stroboscopically  sampling  the  data 
at  the  period  of  the  orbit  one  wishes  to  stabilize.  This  periodic  orbit  is  a  fixed  point 
on  the  map.  The  controller  is  applied  in  such  a  way  as  to  force  the  system's 
trajectories  to  approach  the  stable  manifold  (E$p)  of  the  periodic  orbit  (P).  Ott  et. 
al.  (1990)  have  successfully  stabilized  some  periodic  orbits  of  the  Henon  attractor 
by  applying  the  controller  once  within  each  period.  The  same  technique  has  been 
implemented  experimentally  by  Ditto  et.  al.  (1990)  to  stabilize  two  otherwise  non¬ 
stable,  periodic  orbits  embedded  in  the  chaotic  vibrations  of  a  magnetostrictive 
ribbon. 

2.3  OPTIMAL  CONTROL 

Optimal  control  involves  the  minimization  of  an  appropriately  defined  cost 
function  (Lions,  1971).  For  example,  when  one  is  applying  heating  and  cooling  for 
drag  reduction  in  boundary  layer  flow,  the  cost  function  may  be  defined  as: 

T 

0(q)  =  ~  I dt(jjw  q(x,y,t)2dxdy  +  D)  /3x 

where  q(x,y)  is  the  distribution  of  heating/cooling  fluxes  along  the  surface;  the 
integration  is  carried  out  over  the  entire  surface  over  time  interval  T;  D  is  the  drag; 
and  w  is  a  weight  which  represents  the  control's  cost.  In  the  case  of  convection 
suppression  in  the  Rayleigh-Benard  problem,  D  may  represent  the  absolute  value 
of  the  vertical  velocity  component  integrated  along  a  horizontal  cross-section  or  the 
Nusselt  number.  The  objective  is  to  obtain  the  distribution  q  which  minimizes  the 
cost  function  O(q)  subject  to  various  constraints.  Thus,  the  optimal  control  problem 
becomes  a  problem  of  the  calculus  of  variations. 

Depending  on  the  system's  constraints,  the  resulting  flow  structure  of  the 

controlled  system  may  or  may  not  be  an  equilibrium  solution  (5o)  of  the  system  (1). 
The  problem  can  be  formulated,  however,  as  a  feedback  control  problem  in  which 
q=q(5-Eo)  with  q(0)=0. 

The  advantage  of  the  "optimal  control"  technique  is  that  it  provides  a 
systematic  algorithm  to  obtain  a  desired  state  of  the  system  and  to  optimize  the 
control  strategy.  Unlike  the  method  described  in  section  2.1,  one  need  not  select  a 
pre-specified  functional  form  for  the  operator  C().  The  disadvantage  is  that  the 


method  is  computationally  intensive  and  the  minimization  of  the  cost  function 
requires  that  the  system  (1)  be  solved  repetitively.  This  may  be  beyond  present 
computational  capabilities  in  situations  in  which  the  system  (1)  consists  of  the 
Navier-Stokes  equations.  To  alleviate  this  problem,  Choi  et.  ai.  (1992)  have 
designed  a  simplified  procedure  which  they  have  entitled  "suboptima)  control"  and 
have  applied  it  to  the  stochastic  Burgers  equation. 

3.  EXAMPLES  OF  THE  IMPLEMENTATION  OF  FEEDBACK  CONTROL 
STRATEGIES 

In  this  section,  I  describe  some  of  the  work  that  has  been  done  at  the 
University  of  Pennsylvania  on  feedback  control  of  convective  flows  in  a  thermal 
convection  loop  (Singer  et.  al.  1991,  Singer  and  Bau,  1991,  Wang  et.  al.,  1991) 
and  in  a  saturated  porous  layer  heated  from  below  and  cooled  from  above  (Tang 
and  Bau,  1992a  and  b).  To  date,  we  have  been  able  to  demonstrate  both 
theoretically  and  experimentally  in  the  case  of  the  thermal  convection  loop  that  a 
feedback  controller,  such  as  the  one  described  in  section  2.1 ,  can  successfully  and 
profoundly  alter  the  bifurcation  structure  of  convective  motions  by  effecting  small 
perturbations  in  the  boundary  conditions.  Currently,  we  are  verifying  that  similar 
results  can  be  achieved  in  a  fluid  layer  heated  from  below  and  cooled  from  above. 


Fig.  2:  Schematic  description  of  the 
experimental  apparatus. 
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Fig.  3:  An  unsealed  bifurcation 
diagram  depicting  various  possible 
solutions  as  Ra  is  increased  (P=4). 
Stable/non-stable  solutions  are 
denoted  by  solid/dashed  lines.  The 
dark  region  represents  the 
appearance  of  the  strange 
attractor  which  exists  for  Ra>Ra a. 
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3.1  THE  THERMAL  CONVECTION  LOOP 

Consider  a  thermal  convection  loop  constructed  from  a  pipe  bent  into  a  torus 
and  standing  in  the  vertical  plane  as  depicted  in  Fig.  2.  The  diameter  of  the  pipe  is 
d;  and  the  diameter  of  the  torus  is  D.  q  is  the  angular  location  of  a  point  on  the 
torus.  The  time-independent  wall  temperature  of  the  pipe  Tw(q),  which  may  vary 
with  the  angular  location  q,  is  symmetric  with  respect  to  the  torus  axis  that  is 
parallel  to  the  gravity  vector.  Variations  in  the  wall  temperature  may  cause  a 
spatial  temperature  distribution  inside  the  fluid  which,  under  appropriate 
conditions,  may  induce  fluid  motion  in  the  loop.  The  convection  in  such  loops  has 
been  investigated  by,  among  others,  Creveling  et.  al.  (1975)  and  Gorman  et.  al. 
(1984,  1986). 

The  motion  in  the  loop  can  be  described  within  the  framework  of 
Boussinesq’s  approximation  by  using  a  one-dimensional  model  consisting  of 
mass,  momentum  and  energy  balances  (Bau  and  Wang,  1991): 


(4) 

(5) 


u  =  u(t), 

«  =  —  Ra  PjTcos(0)dd- Pu,  and 

<6> 

The  fluid  is  assumed  to  be  incompressible  and  Newtonian.  In  the  above,  all 
quantities  are  non-dimensional.  The  length  scale  is  the  torus'  radius  D/2; 
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Ra  =  -pp  is  the  loop's  Rayleigh  number;  (3  is  the  thermal  expansion  coefficient; 

g  is  the  gravitational  acceleration;  and  AT  is  the  averaged  wall  temperature 
difference  between  the  loop's  bottom  and  top.  The  time  scale  is  x=  p0Cpd/(4h), 
where  p0  is  the  fluid's  average  density,  Cp  is  the  thermal  capacity,  and  h  (which  we 
assume  to  be  constant)  is  the  heat  transfer  coefficient  between  the  fluid  and  the 
pipe's  wall.  P*=32v  x/d2=8  Pr/Nu  is  the  loop’s  Prandtl  number,  where  v  is  the 
kinematic  viscosity.  Pr=v/a  and  Nu=hd/k  are  the  conventional  Prandtl  and  Nusselt 
numbers,  respectively,  a  and  k  are  the  fluid's  thermal  diffusivity  and  conductivity. 
B=(d/D)2/Nu  is  the  Biot  number. 

The  system  (4-6)  exhibits  multiple,  non-transient  states  (uo(t),  To(9,t)}  which 
may  exist  only  for  certain  Ra  values  and  whose  stability  characteristics  may  vary  in 
accordance  with  the  magnitude  of  the  Rayleigh  number.  Our  objective  is  to 
stabilize  and  destabilize  some  of  these  equilibrium  states  through  the  use  of 
feedback  control.  Our  control  strategy  is  to  measure  the  deviation  of  the  actual 
temperature  from  its  desired  value  at  various  locations  around  the  loop  and  modify 
the  wall  temperature  accordingly.  For  example,  a  convenient  input  signal  to  the 
controller  is  ATg-g^J-ATg.g.oO),  where  AT3-g(t)  and  AT3-gto(t)  denote, 
respectively,  the  actual  and  the  desired  temperature  differences  between  positions 
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3  and  9  o'clock  around  the  loop.  The  wall  temperature  is  modified  in  proportion  to 
the  above  deviation,  i.e., 

T<9’')=Tnomina|(e>+C!4T3-9«-AT3-9,0(,)l'  <7> 

where  the  controller  C(-)  is  typically  chosen  to  be  a  linear  function  of  its  argument. 
To  facilitate  the  analysis,  the  temperature  distribution  in  the  fluid  and  the  wall 

is  expanded  into  a  Fourier  series  in  terms  of  6  (Malkus,  1977). 


Tw(e,t)  =  W'0(f)  +  £WB(t)  sin(n0),  and 

n=l 

mt)  =  XSn(t)  sin(n6)  +  Cn(t)  cos(n9). 

n=0 


(8) 

(9) 


Expansions  (8-9)  are  substituted  into  equations  (4-6).  By  requiring  that  the 
equations  be  satisfied  in  the  sense  of  weighted  residuals,  the  infinite  dimensional 
system  (4-6)  is  transformed  into  an  infinite  set  of  ordinary  differential  equations. 
Three  of  these  equations,  similar  in  nature  to  the  Lorenz  (1967)  equations, 
decouple  from  the  rest  of  the  set  and  can  be  solved  independently  with  exact 
closure.  In  other  words,  the  entire  dynamics  of  the  system  can  be  captured  by  three 
ordinary  differential  equations: 


« 

u 

—  -C-U, 

P  (10) 

C--US-C  (11) 

s  =  uc  -  s  +  RaWx  (12) 

In  the  above,  the  dependence  on  the  Biot  number,  B,  was  removed  via  the 
simple,  algebraic  transformation  (u,  c,  s,  Ra,  P,  t}  <-  1/(1 +B)  {u,  Ra  Ci,  Ra  Si, 
Ra/{  1+B),  P,  t(1+B)2}.  The  quantities  c  and  s,  respectively,  present  the 
temperature  differences  in  the  fluid  between  positions  3  and  9  and  6  and  1 2  o'clock 
around  the  loop.  Wi  represents  the  difference  in  the  wall  temperature  between 
positions  6  and  12  o'clock.  For  the  uncontrolled  case,  without  loss  of  generality,  we 
set  Wi  =-1. 

The  three  ODEs  (10-12)  possess  a  number  of  equilibrium,  non-transient 
solutions,  some  of  which  are  listed  below: 

(A)  no  motion  state  (u=0); 

(B)  time-independent  motion,  either  in  the  clockwise  (denoted  B-)  or 

counterclockwise  (denoted  B+)  direction  (u=±^Ra-l); 

(C)  chaotic  motion;  and 

(D)  periodic  motions  of  various  periodicities  (i.e.,  for  certain  values  of  the 
Rayleigh  number,  one  observes  stable  periodic  solutions.  Also,  non-stable 
periodic  orbits  are  embedded  in  the  chaotic  attractor). 


Some  of  the  above  solutions  and  their  stability  characteristics  are  depicted 
schematically  in  Fig.  3  for  a  loop  Prandtt  number  P=4  which  we  estimate  to 
approximate  the  loop  Prandtl  number  of  our  experimental  apparatus  (Singer, 
1991).  In  Fig.  3,  we  denote  stable  and  non-stable  solutions  by  solid  and  dashed 
lines,  respectively. 

Briefly,  if  one  were  to  follow  the  chain  of  events  in  the  uncontrolled  system  as 
the  Rayleigh  number  (Ra)  increases,  one  would  observe  no  net  motion  in  the  loop 
for  At  Ra= 1,  the  no-motion  solution  loses  its  stability  through  a  supercritical 
pitchfork  bifurcation  and  is  replaced  by  time-independent  motion.  Depending  on 
random  disturbances,  this  motion  will  be  either  in  the  clockwise  (B-)  or 
counterclockwise  (B+)  direction.  The  motion  solution  is  stable  for  1<Ra<16.  At 
Ra=16,  the  steady  solution  loses  stability  through  a  subcritical  Hopf  bifurcation. 
The  resulting  limit  cycle  is  non-stable  and  its  period  increases  to  infinity  as  the 
Rayleigh  number  decreases  to  /tohom~7-378,  where  the  periodic  orbit  becomes 
an  homoclinic  orbit  and  passes  through  the  no-motion  state  (A).  At  Ra-Ra horn, 
there  is  a  bifurcation  (the  homoclinic  explosion)  which  results  in  an  assortment  of 
non-stable  periodic  and  non-periodic  orbits  known  collectively  as  the  non¬ 
wandering  set  which  is  initially  non-attracting.  As  the  Rayleigh  number  is  further 
increased  beyond  /toA~15.984^/?«3H,  the  non-wandering  set  becomes  a  strange 
(the  Lorenz)  attractor.  The  chaotic  regime  exists  for  Ra>RaA  with  occasional 
windows  of  periodic  behavior.  Non-stable  periodic  orbits  of  various  periodicities 
are  embedded  in  the  chaotic  attractor.  In  the  chaotic  regime,  the  motion  in  the  loop 
consists  of  irregular  oscillations  with  occasional  reversals  in  the  direction  of  the 
flow  as  shown,  for  example,  in  Fig.  3  for  /ta=3/toH(4)=48.  In  Fig.  4,  the 
experimentally  observed  temperature  difference  between  positions  3  and  9  o'clock 
are  depicted  as  functions  of  time.  Qualitatively  similar  behavior  is  exhibited  by  the 
variable  c  when  the  equations  (10-12)  are  integrated  numerically.  The  positive  and 
negative  values  of  c  in  Fig.  4  correspond  to  flow  in  the  counterclockwise  and 
clockwise  directions,  respectively. 

For  example,  the  control  (7)  for  suppressing  chaotic  motion  is  incorporated 
into  the  mathematical  model  by  setting, 


Wx=-l-—(c(t)-c0% 

KCL 


(13) 


where  for  demonstration  purposes,  proportional  control  is  employed.  In  the  above, 
c(t)  and  co  denote,  respectively,  the  actual  and  the  desired  temperature  differences 
between  positions  3  and  9  o'clock. 

With  the  use  of  similar  control  strategies,  we  can  alter  the  bifurcation 
structure  depicted  in  Fig.  3.  More  specifically,  we  can: 

(i)  maintain  the  no-motion  state  at  significantly  higher  temperature  differences 
between  the  heated  and  the  cooled  portions  of  the  loop  than  the  critical  one 
corresponding  to  the  onset  of  convection  in  the  uncontrolled  system  (verified 
thus  far  by  theoretical  calculations  only,  Singer  and  Bau,  1991); 

(ii)  maintain  steady,  time-independent  flow  under  conditions  in  which  the  flow 
would  otherwise  be  chaotic  (verified  by  experiments  and  theory,  Singer  et.  at. , 
1991 ,  and  Wang  et.  al.,  1992). 
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(iii)  stabilize  periodic  non-stable  orbits  which  exist  in  the  chaotic  regime  of  the 
uncontrolled  system  (verified  thus  far  by  theoretical  calculations  only,  Singer 
and  Bau,  1991);  and 

(iv)  induce  chaos  in  otherwise  laminar  (fully  predictable),  non-chaotic  flow 
(verified  by  theory  and  experiments,  Wang  et.  at.,  1992). 
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Fig.  4:  The  experimentally  measured 
temperature  differences  AT3-9  depicted 
as  functions  of  time. 


Fig.  6:  The  temperature  difference 
between  positions  3  and  9  o'clock  is 
depicted  for  /?a= 50  and  P=4  as  a 
function  of  time  for  the  controlled  (K=- 
35,  solid  line)  and  the  uncontrolled 
(K=0,  dashed  line)  systems. 


Fig.  5:  Stability  characteristics  of  the 
counterclockwise  motion  solution  under 
proportional  control.  The  critical 
Rayleigh  number  is  depicted  as  a 
function  of  the  controller's  gain  (K). 


Fig.  7:  The  experimentally  measured 
temperature  difference  AT3-9  is 
depicted  as  a  function  of  time  for  similar 
conditions  to  those  in  Fig.  6.  The 
controller  was  activated  33  minutes  into 
the  run. 


For  example,  we  stabilized  the  time-independent  motion  (B+),  by  employing 

the  control  strategy  described  by  equation  (13)  with  c0(r)  =  V/?a- 1.  Fig.  5  depicts 
our  theoretical  results  pertaining  to  the  effect  of  the  controller’s  gain  on  the  location 
of  the  Hopfs  bifurcation  point  RaH-  Clearly,  negative  (positive)  values  of  K  serve  to 
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stabilize  (destabilize)  the  counterclockwise  motion  solution.  As  the  magnitude  of 
the  controller’s  gain  decreases,  the  magnitude  of  the  Rayleigh  number  at  the  Hopf 
bifurcation  point  increases  and  the  onset  of  chaos  is  delayed. 

Another  interesting  phenomenon  exhibited  in  Fig.  5  is  that  for  sufficiently 
high  values  of  the  controller’s  gain  (i.e.,  K>1.63),  the  Hopf  bifurcation  becomes 
supercritical.  For  example,  in  an  experiment  (Fig.  5)  in  which  we  set  K=i.7  and 
gradually  increased  the  Rayleigh  number,  we  observed  bifurcation  from  time- 
independent  to  stable,  time-periodic  flow. 

Figures  6  and  7  illustrate  the  impact  of  the  feedback  control  on  the  system. 
In  Fig.  6,  the  theoretically  obtained  temperature  difference  between  positions  3  and 
9  o'clock  is  depicted  as  a  function  of  time  for  the  controlled  (solid  line)  and 
uncontrolled  (dashed  line)  systems  for  the  same  nominal  operating  conditions 
(Ra=50~3RaH).  The  uncontrolled  system  exhibits  chaotic  oscillations  while  the 
controlled  system  exhibits  laminar  behavior.  Similar  behavior  is  observed  in 
experiments.  Fig.  7  depicts  the  experimentally  obtained  temperature  difference 
between  positions  3  and  9  o'clock  as  a  function  of  time  for  the  same  nominal 
operating  conditions  as  in  Fig.  6.  To  illuminate  the  impact  of  the  feedback  control, 
the  behavior  of  the  uncontrolled  system  is  depicted  for  about  33  minutes  prior  to  the 
application  of  the  controller.  Witness  that  once  the  controller  has  been  engaged, 
the  seemingly  random,  violent  oscillations  disappear  and  the  flow  is  ’’laminarized". 
The  relatively  low  amplitude  oscillations  seen  in  the  system  after  the  application  of 
the  controller  are  attributable  to  the  fact  that  our  experimental  system  is  relatively 
noisy  and  sensitive  to  changes  in  the  environment. 


Fig.  8:  The  controller  is  being  used  to  Fig.  9:  Stabilization  of  an  otherwise 
induce  chaos  at  Ra=0.6RaH  with  non-stable  periodic  orbit  embedded  in 
K=35W/C.  The  figure  depicts  the  the  chaotic  attractor  for  Ra=50. 
temperature  difference  between 
positions  3  and  9  o’clock  as  a  function 
of  time. 


We  can  also  induce  chaos  under  circumstances  in  which  chaos  will  not 
normally  occur  by  using  positive  feedback  (K>0)  in  (13).  In  this  case,  though,  we 
will  need  to  destabilize  simultaneously  both  the  counterclockwise  (B+)  and  the 
clockwise  (B-)  time-independent  motion  states.  This  we  accomplish  by  employing 
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the  control  strategy  W,  =-l - (Sgn(c)c(r)-c0)  .  The  experimental  results  of  this 

Ret 

strategy  are  depicted  in  Fig.  8,  where  we  show  c(t)  as  a  function  of  time  for 
subcritical  Rayleigh  numbers  Ra~0.6  RaH=10.  To  illustrate  the  difference  between 
the  uncontrolled  and  the  controlled  systems,  we  initially  depict  the  uncontrolled 
behavior  for  t<180n?  Witness  the  relatively  time-independent,  counterclockwise 
motion.  The  controller  was  activated  at  t=1800s.  It  caused  flow  instabilities  to 
amplify,  as  is  evident  in  Fig.  8,  until  eventually  chaotic  flow  developed. 

Finally,  we  can  use  the  controller  to  stabilize  one  of  the  otherwise  non¬ 
stable,  periodic  orbits  embedded  in  the  chaotic  attractor.  We  do  this  by  employing 
a  control  strategy  similar  to  the  one  described  by  equation  (13),  with  co(t)  now 
representing  the  periodic  orbit  we  desire  to  stabilize.  The  periodic  orbit  was 
identified  by  examining  a  return  map.  Alternatively,  one  can  employ  one  of  the 
techniques  described  in  Mindlin  and  Gilmore  (1992).  The  controller  can  be 
employed  either  continuously  in  time  or  at  discrete  time  intervals  such  as  once 
every  period  (i.e.,  K=Ko5(t-nST),  where  8(-)  is  the  Dirac  delta  function,  HT  is  the 
period  of  the  orbit  we  wish  to  stabilize,  and  n  is  an  integer)  or  when  the  orbit  passes 
in  close  vicinity  to  a  point  on  the  periodic  orbit  we  wish  to  stabilize.  Fig.  9  depicts  a 
representative  result  of  the  "cs”-plane  projection  of  a  stabilized  periodic  orbit  under 
conditions  in  which,  in  the  absence  of  the  controller,  the  flow  would  be  chaotic 
(Ra=50). 

As  we  have  seen,  the  thermal  convection  loop  paradigm  exhibits 
complicated  temporal  behavior.  Next,  I  describe  a  strategy  for  controlling 
convection  in  a  system  which  is  not  only  temporally  but  also  spatially  complex. 

3.2  THE  LAPWOOD  PROBLEM 

Let  us  consider  thermal  convection  in  a  saturated  porous  layer  confined  in 
an  upright  box,  heated  from  below  and  cooled  from  above.  The  side  walls  of  the 
box  are  insulated.  As  in  the  case  of  the  thermal  convection  loop  paradigm,  as  the 
temperature  difference  between  the  bottom  and  top  (or  the  Rayleigh  number) 
increases,  the  equilibrium  state  of  the  system  undergoes  a  sequence  of 
bifurcations  from  no-motion  to  time-independent,  cellular  motion  to  oscillatory, 
chaotic  flow.  Our  objective  is  to  shift  the  bifurcation  points  with  the  aid  of  a 
feedback  controller.  I  describe  below  only  the  stabilization  of  the  no-motion  state 
(Tang  and  Bau,  1992  a  and  b). 

I  use,  respectively,  the  height  of  the  box  (H),  the  thermal  diffusion  time  H/a 
(where  a  is  the  equivalent  thermal  diffusivity  of  the  porous  medium),  and  the 
nominal  temperature  difference  (AT*)  between  the  bottom  and  top  of  the  box  as  the 
length,  time  and  temperature  scales.  In  the  uncontrolled  system,  the  non- 
dimensional  bottom  and  top  temperatures  are  assigned  the  values  one  and  zero, 
respectively.  The  non-dimensional,  horizontal  dimensions  of  the  box  are  Lx  and 
Ly.  The  motion  of  the  fluid  is  described  by  the  Darcy-Oberbeck-Boussinesq  (DOB) 
equations  (i.e.,  Joseph,  1976): 
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u  =  B'\-u-Vp  +  R6e:) 

b  =  x~l(“!-uV6+v2ey  (14) 

where  uT={ux,uy,U2}  is  the  divergent  free  (V-m  =  0),  non-dimensional  velocity  vector 
which  satisfies  the  impermeability  boundary  conditions  on  all  solid  surfaces;  x  and 
y  are  horizontal,  Cartesian  coordinates;  z  is  the  vertical  coordinate;  0=T-(1-z)  is  the 
deviation  of  the  non-dimensional  temperature  from  the  linear  (conduction)  profile; 


R  =  is  the  Darcy-Rayleigh  number-  g  is  the  gravitational  acceleration;  k  is 

the  medium's  permeability;  P  and  v  are,  respectively,  the  thermal  expansion 
coefficient  and  the  kinematic  viscosity  of  the  saturating  fluid;  p  is  the  pressure 


deviation  from  its  hydrostatic  value;  B  = 


A  a 
<t>H 2  v 


(usually,  B«1);  <j>  is  the  porosity; 


and  x  =  ^°Cp^eq  is  the  ratio  between  the  equivalent  thermal  capacity  of  the 
(Poc/>)/ 

medium  and  that  of  the  saturating  fluid. 

The  DOB  equations  admit  the  no-motion  solution  {u,  8}=0,  which  for  R<Rc  is 
globally  stable.  The  magnitude  of  Rc  depends  on  the  aspect  ratios  Lx  and  Ly.  For 
example,  for  an  unconfined  fluid  layer  with  rigid  bottom  and  top  boundaries, 
Rc=4ji2  (i.e.,  Lapwood,  1948).  At  R=Rc,  the  no-motion  state  loses  stability  through 
a  supercritical  bifurcation  into  time-independent,  cellular  motion.  The  linearized 
problem  of  stability  is  self-adjoint,  thus  the  growth  rate  at  the  onset  of  instability  is 
real.  In  other  words,  the  bifurcation  occurs  through  a  real  eigenvalue,  and  the 
"principle  of  exchange  of  stability"  is  valid. 

To  extend  the  range  of  stability  of  the  no-motion  solution  to  higher  values  of 
the  Rayleigh  number,  we  modify  the  bottom  temperature  in  proportion  to  the 
deviation  of  some  interior  temperatures  from  their  (no-motion)  conduction  value. 
We  will  assume  that  the  temperature  distribution  is  measured  at  some  horizontal 
cross-section,  i.e.,  at  z=0.5,  as  a  function  of  the  horizontal  co-ordinates  (x,  y).  The 

controller  will  sense  the  deviation  8o.5=0(x,y,O.5,t)  of  the  actual  temperature  from 
its  desired  value  (0=0)  in  the  plane  z=0.5.  (For  R>Rc,  in  the  absence  of  the 

controller,  0O  5  *  0).  It  will,  then,  modify  sliqhtly  the  spatial  distribution  of  the  base's 
temperature  in  proportion  to  the  measured  deviation  and  its  time  derivative,  i.e., 


0(0, x,y)  =  C(0O  5)  =  -Kp60  j 


is  d&o.s 

Kd~1T' 


(15) 


where  Kp  and  Kd  represent,  respectively,  the  gains  of  the  proportional  and  the 
differential  controllers.  Note  that  boundary  condition  (15)  replaces  the  boundary 


condition  0(x,y,O,t)=O  in  the  uncontrolled  problem.  This  can  be  accomplished  in 
practice,  for  example,  by  embedding  a  network  of  computer-controlled  heaters  in 
the  base  plate  and  varying  their  heat  input  in  an  appropriate  manner. 

Next,  we  investigate  the  linear  stability  of  the  controlled  system  in  the 
presence  of  the  controller.  The  analysis  is  somewhat  more  complicated  than  in  the 
uncontrolled  case  since  the  system  (14-15)  is  no  longer  self-adinint  and  one 
cannot  ignore  the  possibility  of  the  no-motion  state  undergoing  a  Nopf  bifurcation 
resulting  in  time-periodic  motion.  As  in  the  classical  case,  the  disturbances  can  be 
decomposed  into  even  and  odd  modes  with  respect  to  the  mid  plane  (z=0.5).  The 
odd  modes,  which  lose  stability  at  R=16x2  (r=4),  have  a  node  at  z=0.5.  Thus,  the 
control  strategy  employed  here  (equation  15)  will  affect  only  the  stability 
characteristics  of  the  even  modes.  The  results  of  the  analysis  are  summarized  in 
Fig.  10,  where  we  depict  the  ratio  between  the  Rayleigh  numbers  of  the  controlled 

and  uncontrolled  systems  at  marginal  stability  (r=R/4rc2)  as  a  function  of  the 
proportional  controller's  gain  (Kp)  for  various  gains  of  the  differential  controller 
(Kd=0.,  0.02,  and  0.06)  for  a  box  with  aspect  ratios  Lx=Ly=1.  The  various  symbols 
in  Fig.  10  represent  the  "most  dangerous"  modes  (n,m).  The  first  number  (n) 
represents  the  number  of  cells  in  one  direction  and  the  second  number  (m) 
represents  the  number  of  cells  in  the  other  direction.  For  example,  the  modes  (1 ,0) 
and  (1,1)  correspond,  respectively,  to  unicellular  two  and  three-dimensional 
convection.  The  solid  and  dashed  lines  represent,  respectively,  bifurcations 
through  real  and  complex  eigenvalues.  The  figure  demonstrates  that  through  the 
action  of  the  controller,  the  critical  Rayleigh  number  at  the  onset  of  convection  can 
be  increased  by  as  much  as  a  factor  of  4.  Through  optimization,  a  further  increase 
in  the  critical  Rayleigh  number  may  be  attainable. 

We  have  also  successfully  applied  similar  techniques  to  stabilize  the  no¬ 
motion  state  in  the  Rayleigh-Benard  problem  (Tang  and  Bau,  1992c). 


Fig.  10:  The  critical  normalized  Rayleigh  number  (r)  at  the  onset  of  convection  is 
depicted  as  a  function  of  the  controller  gain  Kp  for  Kd=0.0,  0.02,  and  0.06.  The 
symbols  represent  the  various  modes  (n,m).  The  solid  and  dashed-dot  lines 
correspond,  respectively,  to  bifurcations  into  time-independent  and  time-periodic 
convection. 
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4.  CONCLUSION 

As  the  simple  examples  described  here  indicate,  the  bifurcation  structure  of 
convective  flows  can  be  modified  in  a  significant  way  through  the  use  of  feedback 
control.  In  the  future,  similar  techniques  may  profoundly  impact  diverse 
technological  processes,  enabling  us,  for  instance,  to  substantially  reduce  drag  in 
turbulent  flows  and  significantly  modify  convection  in  crystal  growth  and  material 
processing  industries. 
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Abstract 

I  describe  two  examples  of  the  application  of  mechanics  to  biological  rod-like  structures 
at  the  molecular  level. 

The  first  is  the  filament  of  a  bacterial  flagellum.  This  is  a  slender  but  rigid  corkscrew¬ 
like  structure  by  which  a  bacterium  propels  itself  through  the  water.  The  filament,  of  diameter 
20  nm,  is  built  from  identical  subunits  in  a  definite  cylindrical  lattice.  The  fact  that  the 
filament  is  curved  rather  than  straight  poses  a  paradox,  which  can  be  resolved  by  mechanics. 

The  second  example  is  DNA,  the  material  of  heredity.  This  molecule  is  a  rod  of 
diameter  2  nm,  with  a  double-helical  structure.  Mechanical  properties,  such  as  intrinsic 
curvature  and  flexibility,  are  a  function  of  the  sequence  of  bases  along  the  molecule;  and  they 
are  used  in  the  recognition  of  particular  sequences  of  DNA  by  protein  molecules.  Again, 
mechanics  enables  us  to  understand  the  behaviour. 

I  also  make  some  general  remarks  about  some  hazards  which  are  likely  to  emerge  when 
we  try  to  apply  mechanics  to  problems  in  molecular  biology. 


1 .  INTRODUCTION 

In  this  talk  I  intend  to  discuss  two  specific  examples  of  the  application  of  mechanics  -  as 
we  understand  that  term  here  in  IUTAM  -  to  the  solution  of  structural  problems  in  molecular 
biology.  The  first  example  is  a  slender,  rigid,  corkscrew-like  filament,  the  bacterial  flagellum, 
which  is  part  of  the  machinery  for  propelling  bacteria  through  water:  it  is  a  filament  of 
diameter  20  nm,  and  is  made  from  a  single  type  of  "building  block",  which  is  a  molecule  of 
the  protein  flagellin.  The  problem  is  to  understand  the  principles  of  construction  of  the 
filament.  The  second  example  is  the  double-helical  DNA  molecule,  a  thread  of  diameter  2  nm, 
which  carries  the  complete  genetic  information  in  every  cell  of  a  living  organism.  Here  the 
problem  is  to  understand  the  mechanical  properties  of  the  thread  -  such  as  its  intrinsic 
curvature  and  flexibility  -  in  terms  of  its  base  sequence,  and  how  these  play  a  role  in  the  ways 
by  which  DNA  works  within  the  cell. 

I  thought  it  would  be  more  interesting  to  present  here  two  examples  of  this  genre  rather 
than  one,  in  spite  of  the  obvious  difficulty  inherent  in  such  a  plan  over  the  time  available  for 
necessary  detail.  My  main  aim  will  be  to  try  to  convince  you  that  mechanics  has  some 
interesting  applications  in  the  field  of  molecular  biology  -  even  if  you  are  quite  unfamiliar  with 
biology  as  a  discipline. 

Let  me  begin  by  making  some  general  remarks  about  collaboration  between  mechanics 
people  and  biologists.  Both  groups  work,  of  course,  within  the  same  general  framework  of 
"scientific  method”;  but  in  practice  there  is  usually  a  high  barrier  between  the  two  sets  of 
people.  One  element  of  this  barrier  is  jargon:  each  side  coins  words  and  expressions  which 
will  be  convenient  for  the  communication  of  ideas  and  thoughts.  However,  as  most  of  us  are 
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aware,  jargon  within  any  field  can  actually  inhibit  clear,  productive  thought.  Thus,  someone 
from  outside  a  given  field  often  has  a  double  task  in  penetrating  the  jargon  to  the  core  ideas  of 
the  subject.  I  shall  try,  of  course,  to  keep  this  talk  relatively  free  from  jargon. 

In  trying  to  overcome  barriers  between  ourselves  and  biologists,  it  is  necessary  for  us  to 
talk  with  biologists;  and  even  more  important  for  us  to  listen  to  them.  For  a  tremendo.s 
wealth  of  experimental  observation  is  available  in  biology;  and  a  mechanician  must  hope  that 
somewhere  among  the  data  a  suitable  problem  may  be  found.  We  are  definitely  not  in  the 
business  of  having  a  solution  and  looking  for  a  suitable  problem  to  which  it  may  be  applied. 
Rather,  we  are  looking  for  tractable  problems  among  a  veritable  haystack  of  empirical  data. 
This  is  an  area  of  work  where,  if  one  can  but  state  the  problem,  one  has  already  come  a  long 
way  towards  its  solution. 

A  special  difficulty  which  mechanics  people  will  experience  in  this  field  is  that  biologists 
talk  and  think  a  great  deal  about  "structure"  at  many  different  levels.  Most  biologists  have 
"hands-on"  experience  with  three-dimensional  chemical  model  structures  of  the  ball-and-spoke 
variety;  and  many  are  also  competent  crystallographers.  When  they  discuss  possible 
alternative  structural  conformations  for  a  molecule  -  whether  large  or  small  -  they  find  it 
natural  to  think  in  terms  of  "minimum  energy"  configurations.  Very  few  biologists  have  ever 
encountered  what  we  call  the  concept  of  stress',  which  is,  of  course,  absolutely  central  in  all  of 
our  applied-mechanics  approach  to  engineering  structures.  This  difference  in  oudook  can  be  a 
big  stumbling-block  to  cooperation. 

As  I  have  intimated,  this  lecture  will  be  rather  scanty  on  details.  Those  who  seek  a  more 
complete  description  of  the  work  will  get  help  from  the  bibliography. 


2 .  CONSTRUCTION  OF  THE  BACTERIAL  FLAGELLUM  [1-4] 

I  was  fortunate  to  have  been  given  this  problem  by  my  friend  Aaron  Klug.  His  main 
interest  is  the  structural  organisation  in  biological  systems;  and  he  detected  in  the  bacterial 
flagellum  a  paradox  which  seemed  to  be  beyond  the  range  of  crystallography  or  chemistry  or 
thermodynamics;  but  which  he  thought  might  be  susceptible  to  an  attack  from  the  standpoint 
of  structural  mechanics  [5]. 

The  main  dimensions  of  a  bacterial  flagellar  filament  are  as  shown  in  the  sketch  of 
Figure  1.  The  filament  is  of  length  a  few  microns  (pm)  and  diameter  20  nm.  It  has  the  form 


Figure  1  (left).  Sketch  of  Salmonella  bacterium  "at  rest",  showing  the  cell  ("body"),  rotary 
motors  and  "normal"  flagellar  filaments. 

Figure  2  (right).  The  surface  lattice  for  a  "straight"  flagellar  filament,  to  a  much  larger  scale: 
this  is  an  "unrolled"  sheet  which  was  formerly  wrapped  around  a  short  piece  of  filament.  The 
position  of  each  subunit  is  marked  by  a  spot;  and  the  two  sides  join  at  the  open  spots. 
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of  a  "rigid"  and  uniform  helix,  typically  with  radius  0.2  pm  (-  200  nm)  and  pitch  2  pm.  The 
filament  is  built  from  a  single  kind  of  protein  building  block  or  subunit,  and  the  simple 
cylindrical  surface  lattice  of  the  filament  contains  near-longitudinal  1 1 -start  helices,  as  shown 
in  Figure  2.  This  and  other  information  about  the  packing-pattern  of  the  flagellin  molecules 
comes  from  electron  microscopy  by  image-reconstruction  techniques  [6];  while  data  on  the 
overall  helical  shape  comes  from  both  electron-  and  light-microscopes  [7,8]. 

The  paradox  perceived  by  Klug  is  this.  If  all  the  building  blocks  in  the  cylindrical  lattice 
were  in  a  strictly  equivalent  relationship  with  their  neighbours,  then  the  rod  would  be  straight. 
But  the  rod  is  actually  curved  into  a  helical  shape.  How,  then,  is  the  construction  arranged? 

Our  early  thinking  about  this  problem  suggested  that  the  solution  might  lie  in  the  idea  of 
some  sort  of  "bistable  switch"  between  two  states  [9],  either  within  the  subunit  or  at  an 
interface  or  connection-point  between  neighbouring  subunits.  (In  terms  of  the  mechanics  of 
the  complete  assembly,  there  can  hardly  be  a  difference  between  these  two  situations.)  At  the 
molecular  level,  a  bistable  feature  of  this  kind  had  already  been  found  within  the  four  parts  of 
the  haemoglobin  molecule  [10]. 

On  the  whole,  progress  was  slow  until  we  took  into  account  data  which  increasingly 
became  available  on  a  range  of  different  helical  forms  which  a  flagellar  filament  can  adopt  in  a 
variety  of  different  circumstances:  the  filament  is  polymorphic.  Briefly,  there  are  (i)  different 
natural  helical  shapes  in  different  mutants  of  salmonella  and  E.  coli.  bacteria  [7];  (ii)  different 
shapes  under  different  environments  -  such  as  different  acidity  and  salt  content  of  the 
surrounding  water  [11];  (iii)  different  forms  when  flagellin  monomers  from  different  strains 
are  mixed  and  co-polymerised  into  artificial  helical  filaments  [7];  and  (iv)  changes  in  helical 
shape  when  a  filament  is  subjected  to  torsional  stress  [12],  and  particularly  when  its  rotary 
motor  reverses  direction,  as  sketched  in  Figure  3  [13,14].  Furthermore,  it  turned  out  that  the 
"family"  of  distinct  helical  waveforms  which  can  be  built  under  all  of  these  different 
circumstances  had  a  limited  number  of  members,  as  shown  in  Figure  4:  the  same  helical 
forms  appeared  over  and  over  again  in  the  different  assays. 


Figure  3  (left),  (a)  In  smooth  swimming,  the  various  filaments  (cf.  Figure  1)  associate  to 
form  a  single  bundle,  (b)  Occasionally  the  rotary  motors  go  into  reverse,  and  a  new 
waveform  is  seen  at  the  distal  end.  These  sketches  were  taken  from  light-microscope  assays 
[14]. 

Figure  4  (right).  Sketch  of  the  family  of  discrete  helical  waveforms  of  flagellar  filaments 
which  are  made  by  copolymerisation  of  protein  monomer  subunits  from  two  strains  of 
bacteria  [1],  and  which  appear  also  in  other  assays. 
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Now  to  a  structural  engineer  investigating  the  building  of  a  helical  rod  in  a  curious  way, 
it  seemed  obvious  that  one  should  map  the  available  data  on  radius  and  pitch  of  the  various 
polymoiphic  helices  into  a  space  of  (uniform)  curvature  k  and  twist  x.  When  we  did  this  and 
took  account  of  the  helical  handedness  of  the  discrete  forms  (right-handed  (+)  or  left-handed 
(-))  [8]  we  found  that  the  points  corresponding  to  the  family  of  discrete  forms  lay  on  a  single 
arc  [2-4];  and  indeed  that  they  lay  close  to  an  array  of  12  points  equally  spaced  on  a  sinusoidal 
curve,  as  shown  in  Figure  5. 

At  this  stage  it  became  not  difficult  to  envisage  a  situation  in  which  the  subunits  -  which 
arc  represented,  conventionally,  by  featureless  spots  in  Figure  2  -  were  rectangular  in  shape, 
and  connected  at  their  comers  in  a  chess-board  fashion,  with  one  type  of  comer-connection 
having  a  bi-stable  feature,  as  shown  in  Figure  6(a)  [2,3].  The  different  helical  forms  could 
then  correspond  to  a  number  (from  0  to  11)  of  entire  longitudinal  strands  of  connections  being 
one  configuration  rather  than  the  other,  as  shown  in  Figure  6(b).  If  we  then  arranged  that  the 
chain  of  "1"  (or  B)  connections  was  slightly  longer  than  the  corresponding  chain  of  "0"  (or  A) 
connections,  then  the  different  numbers  of  "1"  connections  would  produce  different  amounts 
of  curvature  in  the  filament;  and  indeed,  provided  that  the  strands  of  "1"  connections  were  all 
bunched  together  in  one  portion  of  the  circumference  of  the  rod,  as  shown  in  the  examples  of 
Figure  6(b),  the  points  in  the  k,  x  plot  all  lay  on  a  single,  sinusoidal  arc  as  shown  in 
Figure  6(c).  The  points  are  equally  spaced  in  the  x  direction  because  each  additional  strand  of 
"1"  connections  introduces  an  equal  extra  twist  to  the  filament  overall,  as  explained  in  the 
caption  of  Figure  6(b). 

The  way  in  which  the  strands  of  "1"  connections  produce  curvature  may  be  seen  from  a 
simplified  "lion's  cage"  model  [2],  shown  in  Figure  7,  in  which  two  rigid  discs  are  separated 
by  1 1  parallel  elastic  bars,  some  of  which  (in  the  ”  1  ”  state)  are  slightly  longer  than  the  others 
("0"  state)  when  they  are  relaxed.  It  is  an  elementary  matter  to  show  that  the  angle  of  tilt 
between  the  discs  -  which  corresponds,  of  course,  to  curvature  in  the  filament  -  is 
proportional  to  sin(mt/ll),  where  n  is  the  number  of  bars  in  the  longer,  "1"  configuration. 
The  complete  range  of  12  possible  states  includes  two  straight  forms  (n  =  0,  n  =  11):  one  of 
these  has  left-handed  twist  and  the  other  right-handed  twist. 
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Figure  5,  Twist-curvature  plot  of  helical  forms  observed  in  homopolymers  from  different 
strains  or  mutants  of  bacteria  (•)  and  copolymers  of  subunits  from  two  different  strains  (cf. 
Figure  4)  (o).  Arbitrary  units  for  curvature  and  twist.  Point  X  on  the  right  corresponds  to  a 
mutant  with  straight  flagellar  filaments  having  right-handed  twist,  as  in  Figure  2  [6];  and  point 
X  on  the  left  corresponds  to  another  "straight"  mutant,  discovered  later,  with  filaments  having 
left-handed  twist. 


Figure  6.  (a)  Proposed  near-rectangular  subunit,  showing  inter-connections  with  identical 
neighbours.  The  rule  of  connection  is  that  C  joins  to  E,  and  D  joins  either  to  A  or  the 
neighbouring  point  B.  (b)  Four  possible  packing  arrangements  in  the  surface  lattice  (cf. 
Figure  2),  in  which  the  subscript  indicates  the  number  of  near- longitudinal  (vertical)  strands  of 
connections  in  the  longer  "1"  configuration,  corresponding  to  connections  B  in  (a).  The 
horizontal  lines  are  circumferential.  Note  how  the  inclination  of  the  near-longitudinal  columns 
of  subunits  with  respect  to  the  circumference  changes  as  the  number  of  "  1 "  strands  increases, 
as  a  direct  consequence  of  the  geometry  of  packing;  so  that  the  overall  twist  of  the  filament 
changes  in  proportion  to  the  number  of  "1"  strands,  (c)  Curvature  (K),  twist  (t)  plot  for  the 
twelve  theoretical  packing  states  (of  which  four  are  shown  in  (b))  together  with  the  helical 
shape  (for  a  given  contour  length)  of  the  corresponding  waveforms.  The  geometrical 
parameters  of  the  model  have  been  chosen  to  fit  the  experimental  observations  from  Figure  5 
and  from  other  sources,  which  are  marked  by  open  symbols. 
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In  principle,  of  course,  there  could  be  several  different  configurations  with,  say,  n  =  9 
sets  of  longitudinal  strands  in  the  "1"  configuration  and  two  sets  in  the  ”0"  configuration. 
Why,  then,  have  we  chosen  only  one,  as  shown  in  Figures  6(b)  and  7,  in  which  these  two  are 
bunched  together  on  one  side  of  the  filament?  It  is  clear  that  such  a  "bunching"  feature  will 
make  the  range  of  distinct  states  lie  on  the  single  arc  as  in  Figure  6:  but  can  we  adduce  any 
physical  reason  for  this  restriction? 

A  plausible  answer  is  that  if  the  two  slightly  shorter  rods  in  the  model  of  Figure  7  were 
not  adjacent  to  each  other  in  a  single  group,  then  the  elastic  strain  energy  would  be  greater.  It 
is  straightforward  to  obtain  this  result;  and  indeed  a  similar  result  holds  for  all  values  of  n  [2]. 

Let  us  now  look  more  closely  at  die  "chess-board"  model  of  the  arrangement  of 
subunits.  The  essential  feature  of  the  chess-board  scheme  is  that  it  provides  a  way  of 
disposing  subunits  on  the  known  surface  lattice  while  providing  for  longitudinal  strands  of 
connection  which  can  give  the  short/long  "0/1"  feature  that  is  needed  for  curvature.  It  is 
crucial,  of  course,  to  the  model  that  the  "0/1"  pattern  should  propagate  unchanged  along  the 
entire  length  of  a  filament;  and  again  an  elastic  calculation  -  though  admittedly  on  a  plane  grid 
of  elastic  units  -  shows  that  the  "0/1"  switch  will  not  propagate  in  any  direction  of  the  surface 
lattice  (Figure  2)  other  than  the  near-longitudinal  1 1-start  direction  [3]. 

Some  features  of  the  structural  model  are  critical,  while  others  are  not.  Thus,  the 
general  layout,  with  a  single  "0/1"  bistable  feature  for  each  subunit,  and  an  otherwise  simple 
elasticity,  are  all  crucial.  But  it  is  probably  not  essential  to  have  the  subunits  planar,  for 
example.  We  have  also  assumed  tacitly  that  inter- unit  contacts  are  of  the  "frictionless  spherical 
joint"  variety.  This  is  not  unreasonable  as  a  picture  of  connections  between  adjacent  protein 
molecules;  but,  again,  it  is  not  an  essential  feature. 

Some  may  object,  of  course,  that  our  flat  subunits  do  not  look  at  all  like  protein 
molecules.  The  key  point  here  is  that  our  unit  is,  strictly,  a  model  for  the  proper  spatial 
connection  between  the  four  points  at  which  the  molecule  connects  to  its  neighbours. 

We  were  led  to  use  the  scheme  of  four  connecting  points  for  each  sub-unit  by  the 
following  reasoning;  which  turned  out  to  be  incorrect,  as  we  shall  see.  (There  must,  of 
course,  be  an  even  number  of  connecting  points  per  subunit  if  two  units  connect  at  each  joint.) 


Figure  7  (left).  "Lion's  cage"  model  to  explain  how  curvature  of  the  filament  depends  on  the 
number  of  longitudinal  strands  of  connections  in  the  'T*  (long)  and  ”0"  (short)  configurations. 
Here  the  strands  are  represented  by  rods  which  are  bistable  in  their  relaxed  form  -  as  shown  - 
but  are  otherwise  linear-elastic.  When  the  two  short  rods  are  connected  up  to  the  discs,  the 
discs  tilt  relative  to  each  other,  corresponding  to  curvature  in  the  filament. 

Figure  8  (right).  Six-connected  version  of  the  model  subunit,  in  which  the  four-connected 
"surface"  unit  of  Figure  6(a)  is  equipped  with  an  inward-facing  extension  which,  inter  alia, 
ensures  that  the  subunits  pack  in  a  unique  surface  lattice. 
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Thus,  if  one  rigid  block  is  fixed,  and  another  rigid  block  is  then  connected  to  it  by  means  of  a 
spherical  joint,  the  act  of  coupling  removes  three  degrees  of  freedom  from  the  second  block: 
for,  whereas  it  originally  had  6  independent  degrees  of  freedom  in  3-space,  it  now  has  only  3, 
in  the  form  of  3  components  of  rotation.  Hence  we  may  say  that  a  spherical  joint  removes  1 .5 
degrees  of  freedom  per  connecting  block.  Therefore  a  block  with  four  connecting  points  loses 
a  total  of  6  degrees  of  freedom  when  it  is  connected  up;  and  so  it  becomes  fixed  in  space,  and 
part  of  a  rigid  structure. 

This  argument  is  almost  entirely  satisfactory  provided  the  blocks  are  built  into  the  correct 
surface  lattice.  But  in  fact  the  four-connected  subunits  of  the  type  which  we  are  discussing 
can  clearly  be  built  into  an  arbitrary  cylindrical  surface  lattice,  and  not  only  the  specific  lattice 
of  Figure  2  in  which  we  are  interested.  So  what  determines  a  specific  surface  lattice?  The 
only  acceptable  answer  to  this  question  is  the  subunits  themselves.  The  details  of  the  surface 
lattice  must  be  a  consequence,  somehow,  of  the  geometrical  design  of  the  «ubunit:  and  the 
subunits  can  then  self-assemble  into  the  correct  packing  pattern. 

The  extra  geometrical  constraints  that  produce  the  proper  global  packing  pattern  can  be 
provided  by  the  addition  of  two  further  points  of  connection  with  neighbouring  units.  A 
possible  arrangement  is  shown  in  Figure  8  [4],  It  is  clear  that  the  total  number  of  units 
around  the  circumference  of  the  cylinder  will  be  determined  by  the  geometrical  layout  of  the 
inward-projection;  and  indeed  the  overall  twist  of  the  assembly  to  ensure  correct  closure  is 
also  fixed  by  the  additional  geometrical  constraints. 

The  scheme  which  I  have  outlined  here  enables  us  to  see,  in  principle,  how  a  single  kind 
of  subunit  can  build  only  into  one  particular  surface  lattice  with,  indeed,  a  specific  number  of 
longitudinal  strands  of  connections  in  the  "1"  configuration:  the  assembly  simply  will  not  fit 
together  in  any  other  way.  And  a  very  small  change  in  the  geometry  of  the  subunit  would 
make  the  units  assemble  only  into  another,  specific  member  of  the  discrete  family  of 
waveforms. 

There  is  not  much  point  in  pursuing  a  detailed  analysis  of  the  small  changes  in  geometry 
of  the  subunit  which  would  effect  changes  between  particular  helical  forms  within  the  discrete 
range  of  available  states.  The  crucial  point  is  that  the  changes  are  small;  and  moreover  they  are 
of  the  kind  which  would  be  produced  in  the  actual  protein  by  a  variety  of  means,  such  as  a 
change  of  hydrogen  activity,  or  a  mechanical  torque.  And  it  is  not  difficult  to  see  that  a 
copolymerisation  of  two  different  monomers,  each  of  which  by  itself  would  construct  a 
particular  member  of  the  family  of  distinct  helical  waveforms,  could  construct,  between  them, 
an  intermediate  member  of  that  family. 

In  summary,  we  have  here  a  "mechanical"  model  which  produces  all  of  the  main  aspects 
of  the  known  behaviour  of  the  bacterial  flagellar  filament.  The  essential  features  of  the  design 
are  clear,  but  much  of  the  detailed  geometry  of  the  subunit  is  still  uncertain.  The  data  now 
available  on  the  detailed  shape  of  a  subunit  [  15-17]  are  much  better  than  they  were  in  the  mid 
1970s,  when  the  work  which  I  have  described  was  done;  but  they  are  still  insufficient  for  the 
construction  of  a  physically  accurate  model.  It  seems  likely  that  new  experimental  techniques 
will  enable  some  progress  to  be  made  on  this  problem  within  the  next  few  years.  A 
particularly  interesting  recent  result  is  that  the  part  of  the  flagellin  molecule  which  is 
responsible  for  the  special  scheme  of  construction  of  the  filament  occupies  only  a  small 
fraction  of  the  molecule,  at  the  two  ends  of  the  protein  chain  [18]. 


3.  STRUCTURAL  MECHANICS  OF  DNA  [19-23] 

It  has  been  well  known  for  nearly  40  years  that  DNA,  the  molecule  within  every  cell  of 
our  bodies,  that  stores  our  individual  genetic  "design  blueprint",  has  a  double-helical 
structure,  as  shown  schematically  in  Figure  9.  Attached  to  each  of  the  helical  "backbones"  or 
strands  are  chemical  bases  of  four  different  kinds  -  A,  T,  G,  C  -  which  carry  the  genetic 
information  in  three-letter  code-words.  The  bases  form  pairs  in  a  cross-chain  sense;  and  since 
things  are  so  arranged  that  A  always  pairs  with  T  and  G  with  C,  as  in  Figure  9,  the  second 
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strand  carries  the  complement  of  the  code-message  of  the  first.  The  helical  strands  are  made 
from  sugar-phosphate  units,  and  their  degree  of  intertwining  is  such  that  there  are  about  10 
base-pairs  in  one  double-helical  turn  of  DNA. 

When  a  cell  is  about  to  divide  into  two  cells  -  this  being  the  basic  scheme  for  growth  or 
construction  in  all  living  organisms  -  the  DNA  has  to  be  duplicated,  so  that  a  complete  and 
accurate  copy  of  it  can  be  incorporated  in  each  daughter-cell.  This  is  achieved  in  the  cell  by 
first  separating  the  two  strands  from  each  other  and  then  assembling  onto  each,  from  available 
"spare  parts",  the  missing  complementary  strand. 

All  of  this  is  well-known,  high-school  science.  What  is  perhaps  not  so  well-known  is 
that  the  DNA  thread  in  any  "chromosome"  or  packaging  unit  in  the  cell  is  extremely  long  in 
comparison  with  its  width:  its  aspect  ratio  is  typically  about  1  x  107.  Thus,  if  wc  were  to 
enlarge  a  piece  of  DNA  by  a  linear  factor  of  106,  it  would  be  like  a  piece  of  string  2  mm  in 
diameter  and  about  40  km  long.  On  the  same  scale,  the  diameter  of  a  single  cell  would  be 
about  10  m.  We  have  to  conclude  that  there  is  a  colossal  packaging  problem,  to  get  the  DNA 
thread  bundled  up  into  the  available  space,  and  without  tangling. 

Nature  uses  a  hierarchy  of  packing  procedures  to  achieve  this  massive  degree  of 
compaction.  The  details  are  rather  complicated;  but  for  present  purposes  it  suffices  to  say  that 
the  DNA  thread  in  the  cell  is  wrapped  round  a  series  of  "bobbins"  made  from  special  proteins 
called  histones.  The  DNA  wraps  twice  round  each  "histone  spool"  in  a  left-handed 
superhelical  sense,  as  shown  schematically  in  Figure  10.  There  are  about  80  base-pairs  in 
each  turn  of  DNA  around  the  spool.  The  picture  of  Figure  10  is  not  drawn  to  scale:  in  fact, 
the  total  volume  of  DNA  in  the  arrangement  is  approximately  equal  to  the  total  volume  of 
histones. 


Figure  9.  Double-helical  structure  of  DNA:  a  schematic  view  showing  the  "anti-parallel" 
directions  of  the  two  chains,  and  bases,  base-pairs  and  base-pair  steps. 


Figure  10.  DNA  wrapped  around  histone  spools. 


I  have  used  the  word  "superhelical"  here  to  describe  this  wrapping  of  DNA  around  a 
spool  to  distinguish  the  path  of  the  DNA  thread  from  the  right-handed  helical  twisting  of  the 
two  strands  about  each  other,  on  a  much  smaller  scale,  within  the  DNA  thread  itself. 

Before  we  can  proceed  further,  I  must  explain  some  simple  labelling  conventions.  Each 
of  the  sugar-phosphate  chains  has  a  distinct  sense  of  direction,  which  can  be  seen  directly 
from  the  atoms  in  "all-atom"  pictures  (cf.  Figure  12,  below)  but  which  has  to  be  shown  by  an 
arrow  in  a  schematic  drawing  like  that  of  Figure  9.  The  sequence  of  DNA  is  described  by 
giving  a  letter-by-letter  description  of  the  bases  along  one  strand,  in  the  conventional  direction. 
Later  on  we  shall  discuss  double-helical  DNA  as  an  assembly  of  base-pair  steps  ("dinucleotide 
steps").  For  example,  base-pair  steps  TC  (o),  CG  (•),  AA  (A)  and  AC  (A)  may  be  seen  on 
the  strand  which  runs  from  left  to  right  in  Figure  9.  These  same  four  steps  would  be  read  -  in 
the  opposite  order  -  as  GT  (A),  TT  (A),  CG  (•)  and  GA  (o)  with  reference  to  the  other 
strand.  Thus  the  step  TC  (o)  may  be  described  alternately  as  TC  or  GA,  while  the  step  A  may 
be  described  alternatively  as  and  TT  or  AA;  but  step  CG  (•)  reads  the  same  with  respect  to 
either  strand.  Altogether,  there  are  10  different  kinds  of  base-pair  step,  according  to 
sequence. 

Two  further  points  need  to  be  made  here.  The  first  is  that  A  and  G  bases  ("purines")  are 
larger  than  T  and  C  bases  ("pyrimidines").  Inis  difference  will  play  a  part  in  the  structural 
mechanics  of  DNA,  as  we  shall  see.  The  second  is  that  modem  analytical  technology  makes  it 
a  relatively  straightforward  matter  to  determine  the  precise  sequence  of  a  given  piece  of  DNA  a 
few  hundred  base-pairs  long. 

3.1.  Some  questions  about  DNA 

I  propose  now  to  discuss  some  questions  about  DNA  for  which  applied  mechanics  can 
help  to  provide  some  answers.  I  shall  necessarily  be  brief;  but  I  hope  to  be  able  to  give 
sufficient  detail  for  the  discussion  to  be  comprehensible.  Full  details  may  be  found  in  the 
book  which  my  collaboratoi  Horace  Drew  and  I  have  written  recently  [23], 

3.2.  Why  is  DNA  double-helical? 

If  DNA  were  stretched  out  to  make  a  plane  "ladder",  there  would  be  gaps  between 
successive  base-pairs,  because  the  bases  are  3.3  A  thick  (1  A  =  1CH°  m  =  0.1  nm)  whereas 
the  standard  length  of  the  segments  of  connecting  chain  is  about  6  A.  One  step  of  such  an 
artificially  straight  ladder  is  shown  schematically  in  Figure  1 1(a). 

The  DNA  molecule  is  surrounded  by  water  in  the  cell.  Now  the  bases  are  hydrophobic, 
and  thus  they  hate  to  have  water  sandwiched  between  them.  The  twisting  of  the  plane  ladder 
into  a  double-helix  -  shown  for  one  step  in  Figure  1 1(b)  -  thus  allows  the  bases  to  stack 
directly  onto  each  other,  while  the  6  A  chain-lengths  adopt  a  spiral  shape  so  that  they  climb  by 
only  3.3  A  per  step.  A  simple  geometrical  calculation,  based  on  the  dimensions  shown  in 
Figure  1 1(a),  indicates  an  angle  of  helical  twist  in  (b)  of  about  34',  which  is  typical  of  actual 
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Figure  11.  (a)  A  base-pair  step  of  an  artificially  straightened  "ladder"  of  DNA.  (b)  The 
same,  but  with  the  base-pain  arranged  with  helical  twist,  so  that  they  can  stack  directly  onto 
each  other. 
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DNA.  (Such  a  calculation  is,  however,  too  crude  to  predict  whether  the  double  helix  will  be 
left-  or  right-handed.) 

This  picture  of  the  double-helical  structure  of  DNA  leads  to  an  important  point.  The 
sugar-phosphate  chains  are  often  described  as  "backbones"  in  textbooks,  as  if  they  were 
providing  some  rigid  support  to  the  base-pairs  -  as  if  to  the  treads  of  a  spiral  staircase.  In 
fact,  the  structure  is  controlled  by  the  base-stacking  arrangements;  and  the  flexible  chains  have 
to  adjust  their  positions  accordingly.  These  chain-links  provide  some  constraint,  of  course,  to 
the  relative  positions  which  can  be  adopted  by  the  stacking  bases;  but  they  are  not  in 
themselves  the  primary  structural  element  of  the  double-helix. 

3.3.  Why  is  DNA  wrapped  around  the  histone  spools  in  a  left-handed 
fashion? 

When  the  coded  message  of  DNA  is  being  read  by  the  machinery  within  the  cell,  it  is 
necessary  for  the  two  sugar-phosphate  strands  to  come  apart,  so  that  the  special  "reading- 
enzymes"  can  gain  access  to  the  individual  bases  on  one  particular  strand  of  the  DNA. 

Left-handed  wrapping  of  the  DNA  around  the  spools  is  helpful  in  this  respect.  Thus,  if 
we  remove  a  spool  and  pull  the  now  naked  segment  of  DNA  out  straight,  the  DNA  becomes 
strongly  untwisted  if  the  wrapping  around  the  spool  was  in  a  left-handed  sense.  You  can 
easily  demonstrate  this  effect  yourself,  by  wrapping  some  electric  power  cord  around  a  beer 
can,  removing  the  beer  can,  and  pulling  the  cord  out  straight  [24,25,23], 

Thus  Nature  provides  a  neat  and  simple  way  of  untwisting  DNA  when  this  is  needed, 
as  the  first  stage  in  the  process  of  separating  the  two  strands.  In  general,  of  course,  it  is  not 
an  easy  matter  to  untwist  a  piece  of  string:  there  is  a  difficult  problem  in  gripping  the  filament 
strongly  enough.  The  arrangement  in  which  the  DNA  goes  in  a  left-handed  sense  round  the 
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Figure  12  (left).  All-atom  representations  of  DNA  in  the  classical  ”B"  and  "A"  forms, 
showing  two  double-helical  turns. 

Figure  13  (right).  Rigid-block  representations  of  the  base-pairs  of  DNA  in  the  classical  "B" 
and  "A"  forms.  The  sugar-phosphate  chains  are  not  shown,  in  order  to  display  the  geometry 
of  the  base-pair  steps;  and  only  one  helical  turn  is  shown.  (The  scale  is  different  from  that  of 
Figure  12.) 


removable  histone  spools  provides  a  way  of  gripping  and  untwisting  the  DNA  -  in  addition, 
of  course,  to  achieving  an  overall  compaction  in  length  of  the  DNA. 

3.4.  How  does  DNA  change  from  the  "B"  to  the  "A"  form? 

For  this  third  question  we  must  go  back  almost  forty  years,  to  the  time  when 
Rosalind  Franklin  obtained  the  first  x-ray  diffraction  pictures,  from  samples  of  random- 
sequence  DNA  which  had  been  prepared  by  pulling  a  tangled  mass  of  DNA  threads  out  into 
fibres:  this  procedure  aligned  the  individual  pieces  of  DNA  with  respect  to  each  other,  and 
allowed  the  x-ray  method  to  reveal  the  mean  or  average  configuration  of  the  DNA  molecule 
[26]. 

This  exercise  produced  two  distinct  diffraction  patterns:  "A"  when  the  fibre  was  dry, 
and  "B"  when  it  was  wet.  When  these  diffraction  patterns  had  been  analysed  properly,  so  that 
the  detailed  structure  could  be  discerned,  the  two  corresponding  kinds  of  DNA  were  both 
right-handed  double  helices:  but  there  were  some  clear  differences  between  them,  as  may  be 
seen  from  Figure  12.  Thus,  relative  to  the  "B"-form,  a  piece  of  "A"-form  DNA  having  the 
same  number  of  base  pairs:  (i)  is  shorter,  (ii)  is  fatter,  (iii)  has  its  bases  tilted  with  respect  to 
the  overall  axis,  and  (iv)  has  about  10%  less  helical  twist  per  base  pair  [27).  The  first  three  of 
these  features  can  be  seen  clearly  in  Figure  12. 

For  many  years  it  was  not  at  all  clear  what  mediated  the  "B"-to-"A"  transition  when  the 
water-activity  was  reduced.  Usually  the  "backbone"  was  adduced  as  being  somehow 
responsible  -  even  though  it  could  hardly  explain  the  well-established  fact  that  DNA  having 
specific  base  sequences  switched  more  readily  to  the  "B"  form  with  increasing  water  activity 
than  DNA  having  other  sequences. 

As  soon  as  we  accept  the  idea  that  the  double-helical  structure  of  DNA  is  driven  by  the 
arrangements  for  stacking  the  bases  on  top  of  each  other,  we  can  begin  to  see  how  the 
transition  between  the  two  forms  comes  about.  What  we  need  is  an  interpretation  of  the  two 
pictures  in  Figure  12  from  the  point  of  view  of  the  stacking  arrangements  within  a  typical  base 


Figure  14.  Definition  of  the  "roll"  and  "slide”  relative  motions  at  a  base-pairs  step:  "helicai 
twist”  is  shown  in  Figure  1 1(b). 


Figure  15.  Propeller-twist  in  two  base-pairs.  The  bases  are  shown  here  as  if  they  were  all  of 
the  same  size. 
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step.  This  is  shown  in  Figure  13,  where  the  base  pairs  have  been  drawn  as  rigid  blocks,  and 
the  sugai  phosphate  chains  have  been  omitted  entirely  [19].  At  this  point  we  need  to  introduce 
a  technical  definition.  The  so-called  "minor- groove  side"  of  the  DNA  has  been  marked  in 
Figure  13  by  marking  the  edges  of  the  blocks  in  black.  (See  also  the  shading  in  Figure  11, 
which  serves  the  same  purpose.)  This  feature  can  also  be  seen  in  Figure  12:  the  "groove”  is 
the  space  between  the  sugar-phosphate  chains. 

To  go  from  the  characteristic  "B"  step  to  the  characteristic  "A"  step  we  need  simply  to 
apply  both  the  "roll"  and  "slide"  internal  motions,  as  defined  in  Figure  14.  These  are  in  fact 
two  of  the  six  degrees  of  freedom  which  one  rigid  block  has,  in  general,  with  respect  to  its 
neighbour;  and  of  the  six  these  two  are  the  least  constrained  by  the  combined  effects  of  the 
stacking  arrangements  and  the  sugar-phosphate  links.  A  third  degree  of  freedom,  helical 
twist,  is  what  we  have  already  seen  in  Figure  1 1(b).  The  transition  from  the  "B"  to  the  "A” 
form  requires,  on  average,  a  change  in  roll  angle  of  12°,  and  a  change  in  slide  of  about  3  A. 

In  recent  years  it  has  become  possible  to  examine  the  stacking  arrangements  within  the 
base-pair  steps  of  DNA  to  atomic  resolution  by  means  of  x-ray  analysis  of  single  crystals  that 
have  been  made  from  very  pure  samples  of  short  pieces  of  DNA  having  identical  sequences. 
Such  structures  are  almost  always  rather  irregular,  with  strikingly  different  geometries  in 
individual  base-pair  steps,  although  the  double-helical  structure  appears  to  be  uniform  to  first- 
order  [28,29].  An  almost-universal  feature  which  emerges  from  such  studies  is  that  the  base- 
pairs  are  not  actually  planar,  but  have  "propeller  twist”  [28,30].  This  is  sketched  in 
Figure  15.  It  can  be  seen  from  Figure  16  that  if  two  propeller-twisted  base-pairs  are  stacked 
onto  each  other,  then  geometry  requires  that  the  "slide"  motion  will  be  directly  linked  to  the 
"roll"  motion.  And  this  is  precisely  the  arrangement  which  we  have  already  described  as 
being  characteristic  of  the  "B"-to-"A”  transition. 

What,  then,  "drives"  the  transition?  The  explanation  is,  broadly,  that  in  "wet" 
conditions,  the  hydrophobic  forces  make  the  base-pairs  "cover"  each  other  as  well  as  possible, 
thereby  giving  low  "slide",  as  in  Figure  16(a).  But  in  "dry"  conditions  the  hydrophobic  effect 
is  weaker,  and  other  stacking-forces  push  the  base-pairs  into  an  "offset"  or  high-slide 
position,  as  in  Figure  16(b).  (Unfortunately,  these  pictures  are  too  crude  to  show  this  effect 
convincingly.) 

It  is  not  obvious  from  what  I  have  said  so  far  that  the  ”A"-to-"B"  transition  will  take 
place  as  a  clear  "switch"  -  as  it  does  in  practice,  when  the  water  activity  is  steadily  reduced  - 
rather  than  in  a  "smooth"  fashion.  The  key  to  the  situation  is  that  in  double-helical  structures 
which  have  been  studied  by  single-crystal  x-ray  methods,  certain  kinds  of  base-pair  step, 
particularly  of  the  kind  Pyr-Pur  (Pyr  =  pyrimidine,  i.e.  A  or  G;  Pur  =  purine,  i.e.  T  or  C) 
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Figure  16.  Two  distinct  stacking-arrangements  of  a  base-pair  step,  showing  the  coupling 
between  roll  and  slide  on  account  of  propeller-twist.  The  sugar-phosphate  chains  run  in  the 
”5’  -»  3’ "  direction  (cf.  Figure  9),  so  the  smaller  (Pyrimidine)  h*ce  precedes  the  larger 
(Purine)  base  on  each  chain. 


adopt  either  a  high-slide,  high-roll  or  a  low-slide,  low-roll  configuration;  but  not  an 
intermediate  form.  This  is  the  situation  shown  in  Figure  16.  It  seems  that  it  is  base-pair  steps 
of  this  kind  which  produce  the  "switch"  feature  of  the  transition  in  a  tract  of  DNA.  Other 
types  of  step  prefer  a  fixed  configuration:  for  example,  AA  steps  always  have  a  low-roll,  low- 
slide  shape;  and  thus  they  are  reluctant  to  move  over  into  the  ”A"-form. 

3.5.  How  does  DNA  curve  around  a  histone  spool? 

The  bending  of  DNA  is  more  complicated  than  the  bending  of  an  ordinary  elastic  rod  for 
two  main  reasons.  First,  it  is  clear  that  the  helical  structure  of  the  DNA  will  play  an  important 
part  in  bending.  Second,  the  relative  rotations  at  base-pair  step  level  which  are  necessary  for 
changes  in  curvature  in  DNA  can  occur  most  easily  by  means  of  ’’roll"  within  the  individual 
base-pair  steps. 

Figure  17(a)  shows  schematically  a  piece  of  DNA  which  has  been  bent  into  a  curve  of 
about  4.5*  per  step,  on  average  [20],  (This  is  precisely  the  degree  of  curvature  required  for 
bending  DNA  around  a  histone  spool:  360°  in  80  steps.)  The  curvature  is  produced  by 
positive  roll  in  some  base-pair  steps,  but  negative  roll  in  others  which  are  half-a-tum  of  helical 
twist  away.  Simple  geometry  shows  that  curvature  requires  a  periodic  variation  of  roll,  at  a 
period  equal  to  the  helical  repeat  of  the  DNA.  In  Figure  17(a)  the  roll  angles  vary 
sinusoidally  with  position  along  the  molecule.  A  different  way  of  achieving  the  same  overall 
curvature  is  to  have  alternating  patches  of  "B"-form  and  "A"-form  DNA  (as  in  Figure  13), 
with  the  pattern 


location  of  base  pair  step  on  histone  spool 


Figure  17  {left).  Two  double-helical  turns  bent  through  a  90*  curve  (a)  by  a  smooth  variation 
of  roll  angles  and  (b)  by  step-wise  variation  of  roll. 

Figure  18  {right).  Total  number  N  of  (a)  AA  =  TT  and  (b)  GC  base-pair  steps  in  (many  pieces 
of)  curved  chicken  DNA,  plotted  against  location  on  the  histone  spool  (cf.  Figure  10).  The 
overall  length  of  the  DNA  is  145  base-pairs,  but  the  data  were  symmetric,  and  so  could  be 
plotted,  as  shown,  from  end  to  centre. 
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repeating  every  10  base  pairs,  as  in  Figure  17(b).  Plainly,  there  is  no  unique  way  of 
imparting  a  given  curvature  to  DNA. 

Now  if  a  piece  of  ondinarily-straight  DNA  is  to  be  curved  tightly  around  a  histone  spool, 
it  is  necessary  for  changes  of  roll  to  occur  at  some  of  the  base-pair  steps  along  the  mo'  ecule. 
It  should  be  clear  from  previous  remarks  that  some  specific  types  of  base-pair  steps  prefer  a 
low-roll  position,  while  others  prefer  a  high-roll  position;  and  that  others  readily  adopt  either  a 
low-roll  or  a  high-roll  configuration. 

Figure  18  shows  the  results  of  sequencing  pieces  of  DNA  from  chicken  blood  cells 
which  are  known  to  have  been  wrapped  twice  round  histone  cores  [31].  More  than  one 
hundred  distinct  145-base-pair  lengths  of  DNA  of  this  kind  were  sequenced,  and  statistical 
analyses  were  performed  on  the  positional  preferences  for  various  kinds  of  base-pair  step. 
Without  going  into  much  detail,  we  can  see  immediately  that  the  frequency  of  occurrence  of 
steps  AA  and  steps  GC  at  different  positions  around  the  circumference  of  the  spool  varies 
periodically  with  position.  In  both  cases  the  periodicity  is  about  10  base  pairs;  i.e.  it 
corresponds  to  the  helical  repeat  of  the  DNA.  But  closer  examination  shows  that  AA  steps 
prefer  to  occupy  "low-roll"  positions  -  where  the  minor-groove  side  of  the  DNA  faces 
inwards  -  while  GC  steps  prefer  "high-roll"  positions. 

Time  does  not  permit  me  to  go  through  the  details  of  this  substantial  assay;  but  I  may 
remark  that  the  positional  preferences  of  this  kind  that  are  shown  by  particular  dinucleotide 
steps  are  in  general  accord  with  configurations  that  are  found  in  single-crystal  assays  on  the 
same  types  of  step. 

3.6.  How  do  proteins  recognise  particular  sequences  of  DNA? 

There  are  many  biological  processes  in  which  a  particular  protein  has  to  search,  and 
find,  and  attach  itself  to  a  specific  sequence  -  such  as  ACAATATATATTGT  -  in  a  very  long 
piece  of  DNA.  Such  a  sequence,  for  example,  might  mark  the  start  of  a  gene. 

Recognition  processes,  whether  between  protein  and  protein  or  protein  and  DNA,  have 
a  strong  geometrical  component.  Thus,  for  strong  oinding  or  attachment  of  one  to  the  other 
there  must  be  not  only  a  good  geometric  fit  but  also  an  arrangement  of  atoms  within  the  mating 
surfaces  which  allow  the  formation  of  a  sufficient  number  of  specific  hydrogen  bonds.  In  this 
way,  particular  proteins  can  find  and  bind  to  specific  sequences  of  DNA  by  what  is  usually 
called  a  "direct  reading"  process. 

But  there  is  also  a  second,  different  element  in  the  recognition  process,  which  has  only 
recently  been  elucidated.  The  binding  of  "434  repressor"  to  the  14  base  pair  DNA  sequence 
ACAATXXXXATTGT  provides  a  good  example  [32]:  here  X  denotes  a  variable  base. 

For  binding  to  take  place,  the  two  5-letter  end-sequences  of  the  DNA  must  be  exactly 
as  given  above.  But  it  turns  out  that  a  rather  wide  variation  of  sequence  is  tolerable  within  the 


Figure  19.  Schematic  view  of  the  binding  of  434  repressor  protein  to  DNA,  showing  a  central 
"bend"  of  about  four  base-pairs  which  are  not  in  direct  contact  with  the  protein. 


central  four-base  region.  And  indeed  the  strength  of  binding  depends  s)gnificantly  on  the 
DNA  sequence  of  this  somewhat  variable  portion. 

Figure  19  shows  a  sketch  of  the  binding  arrangement.  The  two  end  regions  of  the 
DNA  are  tightly  bound  to  the  protein  by  "direct  reading"  recognition.  Notice  that  the  shape  of 
the  protein  is  such  that  the  central  portion  of  DNA  has  to  bend  through  about  20'  in  order  to 
fit.  Now  we  have  seen  above  that  different  dinucleotide  steps  have  different  positional 
preferences  in  DNA  that  is  curved;  and  it  turns  out  that  there  is  an  excellent  correlation 
between  the  strength  of  binding  on  the  one  hand  and  the  position  preferences  -  as  determined 
from  the  chicken  blood  cell  DNA  wrapped  around  histone  spools  (cf.  Figure  18)  - 
corresponding  to  the  sequence  of  the  central  region  in  the  434  recognition  sequence. 


4.  CONCLUSION 

In  this  lecture  it  has  been  appropriate  to  present  the  various  thoughts  and  ideas  by  means 
of  words  and  pictures.  In  particular,  I  have  not  employed  any  mathematical  symbolism, 
although  in  a  full  account  (e.g.  {23]),  there  would  be  plenty  of  mathematics.  (For  example,  a 
full  treatment  of  base-pair  stacking  in  a  piece  of  DNA  clearly  requires  careful  computation  of 
finite  angular  rotations.) 

In  fact,  the  main  task  in  each  of  the  two  problems  which  I  have  described  has  been  to  set 
up  a  suitable  conceptual  reference -frame  for  thinking  about  the  structural  mechanics  of  the 
situation.  As  I  have  tried  to  demonstrate,  the  resulting  models  enable  us  to  explain,  within  the 
framework  of  structural  mechanics,  a  rather  wide  r  nge  of  physical  enomena. 

The  thought-processes  which  lead  to  these  appropriate  models  have  involved  structural 
mechanics  at  a  most  elementary  level.  Thus,  for  example,  in  each  of  the  two  problems  we 
have  used,  in  different  ways,  the  primitive  observation  that  a  rigid  body  has  six  degrees  of 
freedom  in  three-dimensional  Euclidean  space.  It  is  perhaps  rare  nowadays  to  Find  problems 
where  the  key  to  the  solution  lies  in  such  a  simple  and  elementary  idea.  But  we  have,  of 
course,  also  employed  other  considerations  than  those  of  elementary  kinematics.  For  the  most 
part  these  have  involved,  explicitly  or  implicitly,  some  linearisation  of  the  various  equations 
and  an  invocation  of  elementary  elasticity,  coupled  with  some  simple  ideas  of  mechanical 
bistability.  The  fact  that  the  various  models  produce  satisfactory  results  in  spite  of  ample 
linearisations  suggests  that  the  underlying  ideas  are  both  robust  and  appropriate  to  the 
problems  under  investigation.  And  indeed,  we  may  perhaps  put  both  of  our  problems  into  the 
category  of  those  that  are  solved  in  the  first  instance  by  a  judicious  amount  of 
oversirrplification.  It  remains  to  be  seen,  of  course,  whether  there  are  many  more  problems  in 
molecular  biology  which  will  yield  to  the  same  mode  of  attack. 
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Viscous  fingering  as  a  pattern  forming  system 

Y.  Couder 

Laboratoire  de  Physique  Statistique  de  i'Ecole  Normale  Superieure 
24  rue  Lhomond,  75231  Paris  Cedex  05.  France 


The  conference  I  gave  in  Haifa  corresponded  very  closely  to  a  review  article 
that  I  had  written  earlier^.  In  order  to  avoid  repetition  I  will  only  give  here  a  brief 
summary.  The  interested  reader  can  find  more  complete  information  in  the  above- 
mentionned  article  and  in  the  references  therein. 

Saffman  Taylor  viscous  fingering  is  the  process  which  occurs  in  a  Hele  Shaw 
cell  when  a  fluid  of  small  viscosity  forces  a  fluid  of  large  viscosity  to  recede.  In  such  a 
situation  the  pressure  field  in  the  whole  cell  is  Laplacian  and  the  interface,  which 
moves  with  a  velocity  proportional  to  the  local  pressure  gradient,  is  unstable.  As 
the  interface  is  stabilizedat  small  scales  by  surface  tension,  the  instability  has  a 
characteristic  length-scale  called  the  capillary  length  L  which  can  be  found  by  the 
linear  stability  analysis  of  a  plane  front.  Experimentally  the  viscous  fingering  is  an 
unbounded  instability  and  creates  typical  patterns,  the  problem  being  to  understand 
their  formation. 

Amongst  the  large  variety  of  morphogenetic  instabilities,  two  are  of  a 
particular  interest  for  comparison  with  Saffman  Taylor  viscous  fingering:  the 
dendritic  crystal  growth  and  the  diffusion-limited  aggregation  (DLA). 

Dendrites  are  formed  during  the  growth  of  a  monocrystal  in  a  melt  or  in  a 
solution  in  conditions  far  from  equilibrium.  Here,  due  to  the  growth  of  the  solid  its 
surface  moves  with  a  velocity  which  is  proportionnal  to  the  gradient  of  either  the 
temperature  or  the  impurity  concentration.  Both  these  quantities  have  a  diffusive 
field,  comparable  to  a  Laplacian  field  in  the  limit  where  the  diffusion  length  is  very 
large.  The  front  is  stabilized  by  surface  tension  through  the  Gibbs-Thomson  effect  so 
that  there  is  also  a  typical  length  scale  ls  to  this  instability.  The  dendrites  form 
needles  with  a  parabolic  tip,  the  direction  of  their  growth  corresponding  to  one  of 
the  main  cristallographic  directions  of  the  cristal. 

Diffusion  limited  aggregation  is  a  model  system  introduced  by  Witten  and 
Sander  in  which  random  walkers  moving  on  a  lattice  are  emitted  far  away  from  a 
central  seed.  If  a  walker  visits  any  of  the  sites  neighbouring  this  seed  it  remains 
there.  The  process  is  repeated,  each  walker  stopping  whenever  it  visits  a  site 
neighbouring  an  already  occupied  point.  It  was  shown  that  the  probability  of  visit  of 
a  given  site  by  the  walker  obeys  a  Laplacian  law  and  that  a  region  of  the  aggregate 
grows  with  a  velocity  proportionnal  to  the  local  gradient  of  this  probability.  The 
typical  scale  lu  of  this  system  is  simply  the  mesh  size  of  the  computing  grid. 

In  very  general  terms  the  growth  morphologies  obtained  in  Saffman  Taylor 
fingering  and  their  comparison  with  those  of  the  two  other  systems  show  that  three 
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factors  are  determinant. 

-  The  geometry:  for  Laplacian  systems  the  geometry  of  the  fixed  boundaries  define 
families  of  possible  analytic  solutions  for  the  possible  shapes  of  the  interface  as  well 
as  the  selection  amongst  them. 

-  The  isotropy  or  anisotropy  of  the  system 

-  The  length  scales  involved  in  the  growth.  These  are  usually  the  instability  length 
scale  lc  and  the  total  width  of  the  moving  front.  If  these  length  scales  are  of 
comparable  order  of  magnitude  stable  curved  fronts  can  be  obtained.  If  they  are  very 
far  apart,  fractal  growth  will  occur  in  the  intermediate  range  separating  them. 

The  geometry. 

The  simplest  problem  in  viscous  fingering  is  the  growth  in  a  linear  channel.  If 
the  width  is  of  the  order  of  a  few  L  a  single  finger  moving  steadily  along  the  cell  is 
obtained  occupying  half  of  the  cell's  width.  This  is  one  of  the  archetypes  of  stable 
curved  fronts  and  its  investigation  has  a  long  history.  A  family  of  solutions  was 
found  in  the  initial  work  of  Saffman  and  Taylor  if  surface  tension  was  neglected. 
However  it  is  only  much  later  that  the  selection  of  the  actually  observed  fingers  was 
understood  as  resulting  from  the  effect  of  isotropic  surface  tension.  The  only  other 
geometries  in  which  the  understanding  of  the  stable  fronts  has  reached  the  same 
level  is  that  of  the  recently  introduced  sector-shaped  cells.  In  these  geometries  the 
observed  fingers  alsc  occupy  a  fraction  of  the  cells'  angular  width.  This  fraction  is  an 
increasing  function  of  the  cell’s  angle. 

The  isotropy  or  anisotropy  of  the  system. 

If  either  a  local  disturbance  of  the  tip  or  anisotropy  is  imposed  to  a  Saffman- 
Taylor  finger  growing  in  a  channel,  the  finger  becomes  narrower  and  more  stable.  In 
the  limit  of  a  wide  channel  it  becomes  obvious  that  the  finger's  tip  is  parabolic,  is 
followed  by  lateral  side  branching  and  has  the  general  aspect  of  a  cristalline  dendrite. 
The  introduction  of  these  disturbances  thus  lifts  the  specific  selection  due  to 
isotropic  surface  tension.  The  new  finger  is  no  longer  selected  by  its  width  but  by  the 
radius  of  curvature  at  its  tip  which  is  proportionnal  to  lc.  The  striking  aspect  of  this 
result  is  that  this  is  precisely,  due  to  their  anisotropy,  the  type  of  selection  of  the 
dendrites.  The  isotropy  or  anisotropy  of  a  system  is  thus  an  essential  ingredient  in  its 
growth  morphology  because  it  changes  the  selection  mechanism.  In  general  terms 
the  isotropic  fronts  undergo  a  selection  related  to  the  large  scale  of  the  system  while 
the  anisotropic  ones  are  selected  by  the  small  scale  of  the  system. 

The  length  scales. 

Within  the  limit  of  infinite  viscosity  contrast  and  very  rigid  glass  plates,  it  is 
possible  to  obtain  fractal  growth  in  viscous  fingering.  This  occurs  whenever  the 
length  scale  of  the  front  width  is  very  large  compared  to  lc  and  was  mostly  observed 
in  radial  geometries.  The  resulting  fractal  dimension  is  similar  to  that  of  diffusion 
limited  aggregation. 

Recent  experiments  on  very  unstable  Saffman  fingering  were  done  in  those 
geometries  in  which  the  stable  curved  fronts  are  well  known,  i.e.  the  linear  channel 
and  the  sector  shape  cells.  For  each  of  these  experiments  a  large  number  of  identical 
runs  were  performed  so  as  to  obtain  a  statistical  analysis  of  the  occupancy  of  the  cell 
by  the  patterns.  The  result  is  that  the  region  of  the  cell  with  a  mean  occupancy  larger 


than  average  has  the  shape  and  width  of  the  stable  curved  front  in  this  geometry.  It 
thus  appears  that  the  selection  mechanism  has  survived  the  finger  instability. 
Diffusion  limited  aggregation  numerical  experiments  were  done  in  the  same 
geometries.  They  form  patterns  that  are  fractal  in  a  much  larger  range  because  the 
scales  are  further  apart  from  each  other.  However  their  averaging  produces  the  same 
results.  We  can  note  that  this  averaging  method  works  both  for  isotropic  or 
anisotropic  fractal  growth,  the  average  isotropic  fractal  pattern  being  selected  by  its 
large  scale  and  the  average  anisotropic  patterns  on  its  small  scale.  A  theoretical 
interpretation  of  these  results  in  terms  of  a  modified  mean  field  theory  has  been 
given  recently. 


(1)  Y.  Couder,  Growth  Patterns:  from  stable  fronts  to  fractal  structures  in  Chaos, 
Order  and  Patterns  ,  Edited  by  R.  Artuso,  P.  Cvitanovic  and  G.  Cassati,  Plenum  New 
York  1991 
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Mechanics  in  sport 


G.  Grimvall 


Department  of  Theoretical  Physics,  Royal  Institute  of  Technology, 
S— 100  44  Stockholm,  Sweden 


Abstract 

Sports  offer  many  examples  that  illustrate  both  simple  and  complex  aspects  of 
mechanics.  This  paper  considers  examples  that  require  only  an  introductory  under¬ 
graduate  course  in  mechanics,  but  often  show  surprising  or  thoughtprovoking  results. 
Several  of  them  will  deal  with  official  rules  for  track  and  field  events,  and  the 
implication  of  those  rules  on  the  recorded  result.  Other  examples  deal  with  the 
precision  by  which  some  movements  must  be  executed,  and  the  surprising  ability  of  the 
human  brain  to  cope  with  this.  One  aim  of  the  paper  is  to  provide  the  unversity 
teacher  with  material  that  can  be  used  to  illustrate  some  principles  of  mechanics. 


1.  INTRODUCTION 

There  is  a  rich  but  scattered  literature  on  various  aspects  of  the  mechanics  of  sport. 
Frohlich  [1]  has  written  a  "resource  letter"  on  the  physics  of  sports,  with  references  to 
general  books  and  also  to  90  specialised  articles  on  particular  sports,  ranging  from 
archery  and  athletics  to  cycling  and  karate.  There  is  also  a  selection  of  reprints  on  the 
physics  of  sports  [2].  In  the  general  field  covered  by  the  present  article,  iwo  books 
stand  out  as  exceptionally  good.  One  is  "The  physics  of  ball  games"  by  Daish  [3].  It  is 
divided  into  two  parts  The  first  part  contains  a  general  treatment  of  various  ball 
games,  and  the  second  part  gives  a  mathematical  treatment  at  the  level  of  under¬ 
graduate  courses.  The  other  book  is  "Dyson's  mechanics  of  athletics",  originally 
written  by  Dyson  and  with  its  latest  edition  revised  by  Woods  and  Travers  [4],  It  gives 
a  detailed  description  of  the  mechanics  of  track  and  field  events,  with  a  large  number 
of  illustrative  drawings,  but  with  a  minimum  of  algebra. 

Much  of  what  is  published  on  the  mechanics  and  physics  of  sports  deals  with  sports 
that  are  strong  in  the  US,  e.g.,  baseball  and  golf.  In  addition  to  references  given  by 
Frohlich  [l],  one  may  mention  books  by  Watts  and  Bahil  [5],  Adair  [6],  Schrier  and 
Allman  [7],  Brody  [8],  Cochran  and  Stobbs  [9|,  Brancazio  10]  and  de  Mestre  [11],  and 
recent  papers  on  the  aerodynamics  of  sports  balls  (baseba  i,  golf,  cricket)  [12]  and  on 
features  of  a  baseball  bat  [13].  Below  we  shall  discuss  how  an  athlete's  result  may 
depend  on  parameters  that  are  restricted,  by  the  rules  of  the  sport,  to  have  certain 
values.  In  major  league  baseball,  the  coefficient  of  restitution  of  the  ball  is  required  to 
be  0.546*0.032.  Kagan  [14]  has  considered  the  mechanical  implications  of  such  a 
variation. 

The  classic  book  "Bicycling  science"  by  Whitt  and  Wilson  is  out  in  a  new  and 
revised  edition  [15].  It  contains  many  data  on  the  energy  required  in  transport,  with 
emphasis  on  bicycles. 
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Biomechanics  is  not  treated  in  the  present  paper.  This  broad  field  covers  subjects 
ranging  from  how  runners  save  energy  by  bouncing  on  their  tendons  and  ligaments  and 
on  their  running  shoes  [16J  to  how  a  videofilm  of  athlete's  motions  can  be  digitalised  to 
guide  improvements  in  tne  performance  in  sports  [17].  The  book  by  Hay  [18]  is  a 
standard  reference  on  biomechanics  and  sport. 


2.  EFFECT  OF  INCLINED  TRACKS  AND  FIELDS 


The  international  rules  for  the  athletic  throwing  events  (shot,  discus,  hammer, 
javelin)  say  that  "the  overall  downward  inclination  of  the  landing  sector,  in  the 
throwing  direction,  shall  not  exceed  1:1000".  (In  this  paper,  exact  quotes  from  the 
International  Amateur  Athletic  Federation  Handbook  1992—93  [19]  are  within 
quotation  marks.)  Similarly,  the  overall  inclination  of  the  tracks  in  the  running 
direction  shall  not  exceed  1:1000.  What  is  the  advantage  of  a  maximum  allowed 
inclination  in  throwing  and  in  100  m  sprint? 


2.1.  Throwing  events 

Let  the  implement  strike  the  field  at  a  distance  L  from  the  athlete.  With  a 
maximum  allowed  inclination,  that  point  lies  L/1000  below  the  true  horizontal  plane. 
If  the  trajectory  of  the  landing  implement  makes  an  angle  p  to  the  horizontal  plane,  it 
will  move  an  additional  distance  L/(1000  tan^)  before  it  strikes  the  field.  1.  In  hammer 
throwing  the  trajectory  follows  fairly  well  an  ideal  parabola,  with  ip  k  45°.  Then  tamp 
k  l  and  we  get  the  simple  rule  that  the  recorded  length  is  as  much  longer  as  the 
landing  point  lies  below  the  ideally  horizontal  field,  see  Figure  1.  The  world  record  is 


Figure  1.  A  throw  following  the  parabola  of  maximum  range  will  come  as  much 
farther  as  the  landing  point  lies  below  the  ideally  horizontal  plane. 


presently  86.74  m,  so  the  advantage  of  a  maximum  allowed  inclination  is  about  9  cm. 
We  remark  that  in  hammer  throwing  (and  also  in  discus  and  javelin)  the  recorded 
length  is  given  to  the  nearest  even  centimeter  below  the  measured  length,  or  the 
measured  length  if  that  is  obtained  as  a  whole  even  centimeter.  In  shot  putting,  the 
recorded  length  is  given  in  whole  centimeters. 

In  javelin  throwing  the  measurement  is  made  at  the  point  where  the  metal  head 
first  strikes  the  ground,  even  if  the  javelin  afterwards  falls  flat.  The  javelin  may  often 
seem  to  land  at  a  small  angle  p  to  the  horizontal  plane,  but  this  is  largely  an  illusion. 
Even  if  the  javelin  axis  is  almost  horizontal,  the  trajectory  of  the  center  of  gravity  (and 
hence  also  of  the  metal  head)  approaches  the  ground  more  steeply.  For  an  estimate  we 
can  take  ta nip  a  1,  as  for  hammer  throwing.  The  present  world  record  is  94.74  m,  so 
also  in  javelin  throwing  does  the  maximum  allowed  inclination  of  the  field  give  an 
advantage  of  about  9  cm. 

In  shot  putting  the  world  record  is  22.47  m,  and  hence  the  advantage  of  an  inclined 
field  is  at  most  about  2  cm.  Shot  puts  that  under  ideal  measurements  differ  by  0.9  cm 
may  sometimes  be  recorded  as  being  of  equal  length,  according  to  the  rule  cited  above. 


2.2.  100  m  sprint 

The  throwing  events  allowed  a  satisfactory  treatment  based  on  elementary 
mechanics.  The  sprint  events  are  considerably  more  difficult  to  discuss.  For  illustrative 
purposes  we  shall  treat  them  under  some  elementary,  although  possibly  dubious, 
assumptions. 

As  a  rule  of  thumb,  a  reasonably  well  trained  person  walking  in  mountain  areas  can 
climb  300  m  in  a  hour.  The  famous  Baedeker  tourist  guide  suggests  320  m/h  for 
vertical  ascent,  while  the  instruction  of  the  Austrian  army  says  300  m/h  when  carrying 
a  load.  These  numbers  correspond  to  a  power  of  about  80  W,  for  the  increase  in 
potential  energy  alone.  When  the  climb  is  of  more  limited  duration,  man  can  produce  a 
much  higher  power.  For  instance,  the  1987  record  for  running  up  the  stairs  in  the 
Empire  State  Building  in  New  York  (320  m  vertically)  is  10  min  59  s,  corresponding  to 
a  power  of  somewhat  more  than  300  W  for  the  change  in  potential  energy.  All  the 
previous  excercises  require  the  burning  of  oxygen.  In  sprint,  the  athlete  makes  direct 
use  of  chemical  energy  stored  in  the  muscles.  To  get  a  crude  estimate  we  assume  that 
he  can  develop  1.5  I  W. 

Now  consider  100  m  sprint,  and  let  there  be  an  upward  inclination  of  1:1000,  i.e.  a 
total  of  0.1  m.  Suppose  that  the  athlete  needs  an  extra  time  t\  so  that  the  energy  Pt' 
corresponds  to  the  increase  in  potential  energy  due  to  the  slope.  With  a  power  P  =  1.5 
kW,  t'  will  be  about  0.05  s.  For  all  races  up  to  and  including  10  000  m,  times  are 
recorded  to  within  0.01  s. 

If  there  is  a  downward  slope  along  the  running  direction,  it  is  not  obvious  how  the 
gain  in  potential  energy  can  be  used  to  increase  the  speed.  However,  between  two 
consecutive  steps  the  body  follows  a  trajectory  that  is  similar  to  that  described  by  the 
implements  in  throwing  events,  although  with  a  much  smaller  angle  p.  If  the  character 
of  the  leg  motion  is  not  changed  due  to  the  inclined  track,  each  step  in  free  flight  will 
be  longer  by  a  factor  [l  +  1/(1000  tam/?)].  Therefore,  with  an  inclined  track  the  steps 
need  not  be  taken  so  often  and  the  muscles  may  provide  more  energy  per  step,  leading 
to  an  increased  running  speed.  With  tan^?  =  1  and  a  power  consideration  as  above  we 
again  get  about  0.05  s  advantage.  With  a  realistic  and  much  smaller  t&n<p,  the  advan¬ 
tage  would  be  quite  significant.  However,  it  is  not  clear  that  the  power  can  be  used  as 
we  have  assumed,  and  it  is  not  clear  that  the  inclined  track  will  not  slightly  but 
detrimentally  affect  the  leg  motion. 

As  a  final  illustrating  calculation  we  again  assume  that  the  athlete  can  directly 
benefit  from  the  gain  in  potential  energy.  McFarland  [20]  considered  the  effect  of  the 
reduced  air  pressure  in  Mexico  City  and  assumed  that  the  reduced  energy  waste  on  air 
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resistance,  compared  to  the  sea  level,  could  be  used  directly  to  increase  speed.  From  his 
data  we  find  that  a  gain  in  energy  by  about  1  kJ  during  the  100  m  race  results  in  a 
time  that  is  shorter  by  0.09  s.  In  our  case  an  athlete  of  mass  75  kg  gains  73  J  in 
potential  energy  on  a  track  with  maximum  allowed  inclination,  corresponding  to  an 
advantage  of  0.007  s,  i.e.,  a  negligible  amount. 


3.  RULES  ON  THE  EQUIPMENT 


3.1.  The  hammer 

A  hammer  consists  of  a  metal  head,  a  wire  and  a  grip.  "The  head  shall  be  of  solid 
iron  or  other  metal  not  softer  than  brass  or  a  shell  of  such  metal,  filled  with  lead  or 
other  solid  material  with  a  minimum  diameter  of  110  mm.  It  must  be  completely 
spherical  in  shape."  Further,  its  "centre  of  gravity  shall  be  not  more  than  6  nun  from 
the  centre  of  the  sphere".  The  centre  of  gravity  is  normally  not  at  the  centre  of  the 
sphere,  in  spite  of  the  requirement  that  the  head  shall  be  "completely  spherical  in 
shape",  because  the  head  must  have  some  kind  of  attachment  to  the  string.  The  rules 
suggest  an  ingenious  way  to  test  whether  the  head  satisfies  the  requirement  on  the 
centre  of  gravity.  "H  must  be  possible  to  balance  the  head,  less  handle  and  grip,  on  a 
horizontal  sharpedged  circular  orifice  12  mm  in  diameter",  cf.  Figure  2.  Of  course  it  is 
then  understood  that  the  head  must  balance  for  all  of  its  orientations. 


Suggested  apparatus  for  testing  centre  of 
gravity  of  Hammer  head 


Figure  2.  Illustration  in  the  International  Amateur  Athletic  Federation  Handbook  [19j. 


3.2.  The  hammer  cage 

"All  hammer  throws  shall  be  made  from  an  enclosure  or  cage  to  ensure  the  safety  of 
spectators,  officials  and  competitors.  The  cage  should  be  designed,  manufactured  and 
maintained  so  as  to  be  capable  of  stopping  a  7.26  kg  hammer  head  moving  at  a  speed 
of  up  to  29  metres  per  second."  One  may  wonder,  why  29  m/s? 

The  well  known  expression  for  the  length  s  of  a  throw  over  a  horisontal  plane,  with 
an  elevation  angle  a  to  that  plane  and  with  the  neglect  of  air  resistance,  is 


o 

s  —  ~  sin(2a) 


(1) 
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The  maximum  range  is  v2/g,  for  o  =  45°.  The  present  world  record  in  hammer 
throwing  is  86.74  m.  Therefore  a  lower  limit  to  the  velocity  v  is  29.2  m/s,  neglecting 
the  small  correction  related  to  the  fact  that  the  launch  is  not  exactly  from  where  the 
distance  is  measured.  Since  air  resistance  is  not  entirely  negligible  (cf.  Section  4.1)  the 
actual  speed  at  which  it  might  hit  the  cage  in  a  world  record  throw  must  be  somewhat 
larger  than  29  m/s,  although  not  as  much  as  30  m/s.  Obviously  the  rules  do  not  allow 
for  a  safety  factor. 

When  throwing  the  discus  one  may  either  use  a  (slightly  modified)  cage  fcr  hammer 
throwing  or  a  specially  designed  cage.  The  latter  should  be  "capable  of  stopping  a  2  kg 
discus  moving  at  a  speed  of  up  to  25  metres  per  second".  The  present  world  record  is 
74.08  m.  Arguing  as  above  for  the  hammer,  the  discus  record  corresponds  to  a  release 
speed  of  at  least  26.9  m/s,  i.e.,  significantly  higher  than  the  design  value  of  25  m/s  for 
the  cage.  The  apparent  inconsistency  is  resolved  if  we  note  that  a  discus  "rides"  on  the 
air  and  therefore  may  travel  farther  in  a  normal  atmosphere  than  in  vacuum.  Actually, 
a  head  wind  will  often  increase  the  length  of  a  discus  throw  [4,21]. 


3.3.  Geometrical  specifications  on  the  implements 

The  hammer  is  specified  geometrically  by  the  diameter  of  the  head  (110  -  130  mm), 
by  the  entire  length  of  the  hammer  measured  from  inside  the  grip  (1175  —  1215  mm) 
and  by  the  position  of  the  centre  of  gravity  (±6  mm).  All  three  parameters  affect  the 
distance  D  from  the  grip  to  the  centre  of  gravity,  which  thus  may  vary  from  1175  —  65  — 
6  =  1104  mm  to  1215  —  55  +  6  =  1166  mm.  To  this  we  should  add  a  distance  of  almost 
1  m,  corresponding  to  the  length  of  the  arms.  However,  it  is  difficult  to  analyse  how  D 
(with  arm  length  added)  affects  the  length  of  throw.  The  release  velocity  depends  on 
both  D  and  on  the  angular  velocity  ui  of  the  string  at  the  moment  of  release.  An 
increase  in  D  is  likely  to  decrease  u.  In  fact,  if  u)  had  been  independent  of  D,  the  largest 
and  the  smallest  D  allowed  by  the  rules  would  result  in  throws  that  differ  in  length  by 
12  %,  i.e.,  about  10  m  for  a  world  class  result.  Obviously  an  argument  based  on  a 
constant  u>  is  useless.  In  this  context  it  can  also  be  remarked  that  the  rotational  motion 
of  the  hammer  does  not  have  the  athlete  as  its  centre.  If  that  were  the  case,  the 
horizontal  orntripetai  force  would  be  up  to  3  times  the  weight  of  the  athlete  and  that 
can  not  be  achieved  with  any  reasonable  frictional  force  between  the  athlete  and  the 
ground.  The  motion  is  more  like  that  of  the  Earth  and  the  Moon,  rotating  about  a 
common  centre  of  gravity. 

In  discus  and  javelin  throwing,  the  air  affects  the  trajectory  of  the  implement.  This 
is  reflected  in  the  precise  specifications  on  the  implement  geometry.  The  discus 
geometry  is  specified  by  5  lengths,  and  the  javelin  by  5  lengths  along  its  axis  and  10 
values  referring  to  various  diameters.  In  1984  new  rules  were  introduced  for  the  javelin, 
moving  the  centre  of  gravity  to  the  rear  by  4  cm  and  making  the  rear  end  somewhat 
thinner.  That  reduced  the  world  record  by  about  15  m.  In  hammer  throwing  the  effect 
of  the  air  is  difficult  to  estimate  more  precisely,  because  of  the  string  and  grip  attached 
to  the  head.  In  Section  4.1  we  crudely  estimate  it  to  reduce  the  length  of  a  throw  by 
about  2  m.  In  major  events  the  hammers  are  provided  by  the  organisors.  Since  the 
diameter  of  the  head,  according  to  the  rules,  should  lie  between  110  mm  and  130  mm, 
two  acceptable  hammers  may  differ  in  the  area  of  the  head  by  40  %,  while  still  having 
the  same  mass.  Air  resistance  is  proportional  to  the  cross  sectional  area,  so  the  two 
hammers  may  be  expected  to  give  recorded  lengths  that  differ  by  more  than  a  meter 
(cf.  Section  4.1). 

Continuing  on  an  analogous  theme,  we  note  that  in  discus  throwing  the  range  of 
supply  of  equipment  for  competition  must  be  in  the  range  2.005  kg  to  2.025  kg, 
according  to  the  rules.  It  is  difficult  to  calculate  what  a  difference  by  0.020  kg  in  mass 
does  to  the  length  of  a  throw  because  the  ability  to  exert  muscular  force  on  the 
implement  is  not  independent  of  its  opposing  inertial  force.  However,  we  are  somewhat 
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helped  by  the  fact  that  the  discus  for  women  has  a  mass  of  1  kg,  and  the  world  record 
(76,80  m)  is  longer  than  that  for  men  (7 4.08  m).  In  view  of  known  differences  between 
men  and  women  in  other  sports  requiring  strength,  it  seems  likely  that  the  mass 
difference  of  1  kg  affects  the  length  of  a  throw  by,  say,  20  m.  A  mass  difference  of  0.020 
kg  would  then  correspond  to  almost  half  a  meter  in  the  recorded  result,  if  we  can 
extrapolate  linearly  from  the  1  kg  through  the  2  kg  discus. 


4.  AIR  RESISTANCE  AND  WIND  VELOCITY 


Air  resistance  affects  the  results  not  only  in  the  throwing  events  but  in  many  other 
sports  as  well.  One  may  think  of  the  world  record  by  Beamon  in  the  long  jump  at  the 
Olympic  games  in  1968  in  Mexcico  City,  that  is  often  given  as  an  example  although  we 
shall  see  below  that  the  effect  was  quite  small.  In  golf  there  is  not  only  a  retarding  air 
resistance.  The  Magnus  effect  on  a  spinning  ball  may  help  to  carry  the  ball  typically  75 
m  farther  than  in  an  air  and  50  m  farther  than  in  vacuum  [9].  In  this  Section  we  shall 
compare  a  dimensionless  "figure— of— merit"  for  the  ordinary  air  resistance  on  various 
balls  and  then  discuss  wind  speed  versus  air  pressure  for  some  sports. 


4.1.  A  figure-of-merit  for  air  resistance 

The  standard  expression  for  the  air  resistance  F  acting  on  a  body  with  a  smooth 
surface  moving  with  a  speed  v  in  air  of  density  p  is 

F  =  \  CdA/>v2  (2) 

Here  A  is  the  cross  sectional  area  and  Cd(Re)  is  a  drag  coefficient  that  varies  with  the 
Reynolds  number  Re.  For  a  sphere  with  radius  r, 


(3) 


where  u  is  the  kinematic  viscosity  of  air.  For  a  sphere,  in  air  at  normal  pressure  and 
temperature,  Re  =  12.8-  104rv,  if  v  is  expressed  in  m/s  and  r  in  m.  For  103  <  Re  <  Rec 
*  2- 105,  and  for  a  sphere,  we  may  take  Cd  =  0.5.  When  Re  >  Rec,  Cd(Re)  decreases 
significantly,  i.e.,  the  air  resistance  is  lowered. 

In  a  uniformly  retarded  linear  motion,  the  distances  covered  in  a  time  t,  with  initial 
velocity  v0  and  retardation  a,  is 


s  =  v0t-^at2  (4) 

If  a  is  small  it  is  useful  to  introduce  a  dimensionless  quantity  £  that  measures  how  the 
retardation  affects  s.  We  define 


e  = 


at2  _  at_ 
2v0t  2v0 


(5) 


which  yields 
s  =  v0t(l  -  0 


(6) 


Similarly,  the  velocity  v  after  a  time  t  is 
v  =  v0  -  at  =  v0(l  -  2£) 
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(7) 

Let  m  be  the  mass  of  the  moving  body,  and  take  a  =  F/m,  with  F  from  Eq.  (2). 
Further,  let  t  =  t0  be  the  time  of  interest  (e.g.,  the  time  to  complete  the  trajectory  of  a 
ball).  Since  t0  is  usually  not  directly  available,  we  replace  it  by  4/v0  where  4  is  a 
characteristic  length,  e.g.,  the  length  of  a  throw.  Then 

«  =  jcdA^(4)(5i_)  =  l(Ci4£4)  (8) 

In  Table  1  are  summarised  characteristic  parameters  and  the  resulting  £  for  some 
sports.  For  all  cases  we  have  taken  C<j  =  0.5.  When  the  trajectory  of  the  object  is  not 
straight,  the  relation  between  t0,  v0  and  4  is  complex.  We  have  taken  v0  to  be  the 
speed  of  release  etc.,  without  resolving  it  into  vector  components.  The  distance  4  is 
chosen  to  be  the  straight  distance  between  the  end  points  of  the  trajectory  of  interest. 
Variations  in  £  by  50  %  are  not  significant. 


Table  1 

Parameters  governing  the  air  resistance 


Sport 

Area 

A  (cm2) 

Mass 

m  (kg) 

Speed 

v0  (m/'s) 

Length 

4  (m) 

£ 

Re 

Shot 

95 

7.26 

15 

22 

0.005 

100  000 

Hammer 

95 

7.26 

29 

90 

0.02 

200  000 

Golf 

14 

0.046 

70 

200 

1 

190  000 

Tennis 

32 

0.058 

60 

20 

0.2 

240  000 

Table  tennis 

11 

0.0025 

25 

7 

0.5 

60  000 

Soccer 

380 

0.43 

30 

30 

0.5 

420  000 

Long  jump 

5000 

70 

10 

8 

0.01 

500  000 

The  values  in  Table  1  for  the  ball  games  do  not  include  any  effect  of  a  spinning  ball. 
The  density  of  air  is  taken  to  be  that  at  sea  level.  The  velocities  are  those  at  the  start 
of  the  trajectory,  for  world  class  performances. 

We  see  that  for  soccer  Re  >  Rec  so  Cd  is  significantly  smaller  than  the  value  0.5 
assumed  in  Table  1.  This  reduces  £  to  about  0.3.  A  kick  that  would  carry  the  soccer 
ball  about  40  m  if  the  standard  value  Cd  =  0.5  is  used,  actually  travels  about  60  m  [3]. 
In  golf  Re  <  Rec  according  to  Table  1,  but  our  estimate  of  Rec  refers  to  a  smooth 
surface  of  the  ball.  Golf  balls  have  dimpled  surfaces  which  considerably  reduces  the  air 
resistance.  For  long  jump,  Re  estimated  from  a  sphere  may  be  less  relevant. 

The  approach  in  the  beginning  of  this  Section  assumed  that  £  <<  1.  It  is  obvious 
that  the  ball  games  cannot  be  treated  by  considering  air  resistance  as  a  small 
perturbation. 


4.2.  Wind  speed  versus  air  pressure 

The  expression  (2)  for  the  air  resistance  contains  a  velocity  v  which  of  course  is  to 
be  taken  relative  to  the  air.  A  tail  wind  with  a  speed  v’  would  reduce  the  retarding 
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force  F,  This  is  insignificant  when  £  <<  1.  In  fact,  £  does  not  contain  v  —  v1  because 
the  tail  wind  gives  a  second-order  effect  when  £  <<  1. 

In  the  track  and  field  events  long  jump,  triple  jump  and  running  up  to  and  including 
200  m,  the  international  rules  state  that  "if  the  wind  velocity  measured  in  the  direction 
of  jumping"  (etc.)  "averages  more  than  2  metres  per  second,  the  record  will  not  be 
accepted".  The  wind  speed  is  measured  and  averaged  automatically,  during  a  certain 
time  that  varies  with  the  event. 

The  air  resistance  depends  not  only  on  the  speed  through  the  air  but  also  on  the  air 
density,  i.e.,  indirectly  on  the  altitude.  The  air  pressure  at  sea  level  is  about  1010  hPa. 
In  Mexico  City,  located  at  an  altitude  of  2200  m,  the  pressure  is  about  780  hPa.  From 
the  gas  law,  pV  =  RT,  we  get  that  the  density  p  is  proportional  to  the  pressure  p,  at 
equal  temperature?  T.  Hence  p  is  23  %  lower  in  Mexico  City  than  in,  e.g.,  Rome.  Now 
compare  an  event  in  still  air  in  Mexico  City  with  an  event  with  a  tail  wind  in  Rome. 
The  air  resistance  in  the  two  cases  is  the  same  if  there  is  a  tail  wind  v'  in  Rome  so  that 
(v-v')2/v-  =  780/1010.  When  v  «  11  m/s,  as  is  approximately  the  case  in  100  m  sprint, 
v1  =  1.3  m/s.  McFarland  [20]  has  considered  how  much  the  lower  air  resistance  in 
Mexico  City  affects  the  time  for  100  m  sprint.  He  allows  for  the  varying  speed  during 
the  run,  and  assumes  that  the  kinetic  energy  (i.e.,  the  speed)  increase  corresponds  to 
the  lower  energy  loss  due  to  air  resistance  at  low  pressure.  This  model  calculation 
suggests  that  running  in  Mexico  City  ,ather  than  at  sea  level,  and  in  both  cases  in  still 
air,  gives  an  advantage  of  about  0.09  s.  Based  on  this  result  we  would  expect  that  a 
maximum  tail  wind  of  2  m/s  gives  an  advantage  of  about  0.13  s. 

The  combined  effect  of  the  low  air  density  in  Mexico  City  and  a  maximum  allowed 
tail  wind  of  2.0  m/s  there,  would  be  equivalent  to  competing  at  a  place  where  the  air 
pressure  is  reduced  to  52  %  of  that  at  sea  level.  This  combined  effect  may  lower  the 
time  by  0.2  s  in  100  sprint. 

We  finally  note  that  a  change  in  temperature  affects  the  air  density.  An  increase  by 
20  °C,  which  is  not  unreasonable  if  we  compare,  e.g.,  Stockholm  and  Rome,  corre¬ 
sponds  to  a  difference  in  p  by  6  %  when  the  pressure  is  held  constant.  That  means  an 
advantage  by  0.02  s  in  100  m  sprint,  but  only  about  1  cm  in  long  jump.  In  hammer 
throwing  its  effect  is  larger  than  about  10  cm,  i.e.,  larger  than  the  effect  of  a  maximum 
allowed  inclination  of  the  field. 


5.  RECORDS  AND  GRAVITATION 


The  acceleration  of  gravity,  g,  varies  from  place  to  place.  One  might  first  think  that 
high  altitudes  would  reduce  g  significantly,  but  that  is  a  small  effect  compared  to  the 
variation  of  g  with  the  latitude  <p.  It  is  a  good  approximation  to  take 

g  =  9.832  -  0.052  cos2yJ  (m/s2)  (9) 

The  dependence  on  <p  has  two  causes,  one  being  a  "direct"  centrifugal  effect  due  to  the 
rotation  of  the  earth  and  the  other  coming  indirectly  from  the  fact  that  the  rotation 
also  changes  the  shape  of  the  earth  to  a  somewhat  flattened  sphere.  The  actual 
acceleration  of  gravity  may  differ  slightly  from  the  result  of  Eq.  (9)  due  tc  local 
variations  in  the  mass  disiribution  on  the  Earth,  but  for  our  purposes  Eq.  (9)  is 
sufficient.  From  it  we  get  the  results  of  "  able  2. 


Table  2 

The  acceleration  of  gravity,  g  (m/s2),  in  some  cities. 

Stockholm  9.818  Rio  de  Janeiro  9.788 

Berlin  9.813  Sho  Paulo  9.789 

Mexico  City  9.780  Melbourne  9.800 


Now  consider  a  standard  parabolic  trajectory,  for  which  s  =  v2sin(2o)/g.  With  a 
given  launch  speed  v  and  elevation  angle  a,  a  change  in  g  changes  the  distance  s  by  As. 
From  Eq.  (9)  we  can  write 

As/s  =  |Ag/gj  =  0.0053  cos2<p  (10) 

One  of  the  classic  world  records  is  that  of  the  Japanese  Tajima,  whose  16.00  m  in 
triple  jump  at  the  Olympic  games  in  Berlin  in  1936  was  not  matched  until  1950,  when 
Ferreira  da  Silva  from  Brazil  also  reached  16.00  m,  in  Sao  Paulo.  The  following  year  he 
jumped  16.01  m  in  Rio  de  Janeiro.  However,  we  see  from  Table  2  that  the  lower 
acceleration  of  gravity  in  S2o  Paulo  and  Rio  de  Janeiro,  compared  to  Berlin, 
corresponds  to  an  advantage  of  about  4  cm  for  Ferreira  da  Silva.  This  fact  started  a 
public  discussion  at  the  time,  whether  not  Tajima's  result  was  still  the  best.  The  same 
argument,  when  applied  to  hammer  throwing,  gives  an  advantage  to  competitors  in 
Mexico  City,  over  those  in  say  Stockholm,  by  34  cm.  This  is  from  the  difference  in 
gravity  alone.  About  an  equally  large  advantage  may  come  from  the  lower  air  pressure, 
and  hence  reduced  air  resistance  on  the  hammer,  in  Mexico  City,  cf.  Section  4.2. 

McFarland  [20]  has  simulated  numerically  a  long  jump,  a  triple  jump  and  a  high 
jump  in  Mexico  City  and  in  Moscow,  allowing  for  variations  both  in  gravity  and  in  air 
resistance.  He  concludes  that  the  famous  world  record  by  Beamon  got  an  extra  length 
by  about  2  cm  from  the  decreased  gravity,  3  cm  from  the  reduced  air  pressure  and 
another  3  cm  from  the  fact  that  there  was  a  maximum  allowed  tail  wind  of  2  m/s. 
These  results  of  his  detailed  calculations  are  in  all  essentials  in  agreement  with  the 
results  we  arrive  at  using  the  simplified  arguments  in  this  paper. 

A  word  of  caution  is  appropriate  when  one  discusses  the  effect  of  reduced  gravity  on 
sports  records.  In  some  textbooks  one  is  asked  to  estimate  the  world  record  in  high 
jump,  if  competitions  could  be  held  on  the  Moon  where  the  gravity  is  only  1/6  of  the 
value  on  the  Earth.  Obviously  the  record  would  not  be  6  times  higher  on  the  Moon 
than  on  the  Earth,  because  what  matters  is  how  much  the  centre  of  gravity  is  raised  in 
the  jump.  For  a  world  class  male  athlete  on  the  Earth,  it  is  raised  from  about  1.1m  to 
about  2.4  m  when  the  bar  is  passed.  About  8  or  9  meters  would  then  be  a  reasonable 
answer  to  the  Moon  problem.  However  the  altered  gravity  certainly  also  affects  the 
athlete's  approach  and  take-off.  that  will  be  less  forceful  on  the  Moon. 

We  now  return  to  throwing  events.  The  inertial  forces  are  much  more  important 
than  gravity  forces  during  the  athlete's  throw.  Consider,  as  an  illustration,  shot 
putting.  For  our  purpose  a  crude  model  is  sufficient.  We  assume  that  during  the  final 
stage  of  the  put,  the  launch  velocity  v  is  reached  in  a  uniformly  accelerated  motion 
over  a  distance  h.  That  acceleration  requires  a  force  Fj  =  mv2/(2h),  or  Fj  —  400  N  if  rn 
=  7  kg,  h  =  2  m  and  v  =  15  m/s.  During  the  throw  the  athlete  also  raises  the  shot  and 
so  has  to  overcome  the  gravity  force.  The  difference  in  g  between  Stockholm  and 
Mexico  City  corresponds  to  a  change  in  the  gravity  force  on  the  shot  by  0.2  N,  or 
1/2000  of  Fi.  The  arm  and  other  parts  of  the  athlete's  body  must  also  be  accelerated 
and  raised  in  the  gravity  field,  but  that  will  lead  to  approximately  the  same  ratio 
between  altered  gravity  force  and  inertial  forces.  If  the  total  force  applied  by  the 
athlete  is  constant  (i.e.,  independent  of  the  small  gravity  changes)  the  launch  velocity 
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would  increase  by  1/4000  and  the  length  of  the  throw  by  1/2000  in  our  example.  That 
means  1  cm  in  a  20  m  shot  put,  to  be  compared  with  about  6  cm  from  the  effect  of  the 
reduced  gravity  force  during  the  free  flight  of  the  implement. 


6.  THROWING  AT  A  TARGET  -  THE  PRECISION  OF  RELEASE 


In  some  sports  an  implement  is  released  towards  a  mark  or  another  implement  at 
some  distance,  and  the  aim  is  to  come  as  close  as  possible  to  the  target.  Here  we  shall 
first  consider  sports  akin  to  boule,  and  then  curling.  Finally,  references  are  given  to 
work  on  tennis,  golf  and  basketball. 


6.1.  Varpa ,  boule  and  related  sports 

A  vertical  stick  is  placed  20  m  from  the  competitors,  who  are  throwing  flat  stones 
(real  stones,  or  cast  metal  discs)  trying  to  land  as  near  the  stick  as  possible.  This 
particular  version  of  a  universally  spread  game  is  called  varpa  and  has  its  roots  on  the 
island  of  Gotland,  in  the  Baltic  Sea.  Boule  is  a  similar  game,  with  metal  balls  thrown 
at  another  ball  at  a  distance  of  about  15  m.  In  one  version  of  the  varpa  one  measures 
the  distance  between  the  stick  and  the  nearest  part  of  the  stone,  a  champion  has  an 
accumulated  distance  to  the  stick  of  about  400  cm  after  a  completed  competition  of  36 
throws,  i.e.,  about  10  cm  per  throw. 

In  order  to  come  close  to  the  target,  the  direction  sideways  has  to  be  correct.  We 
shall  neglect  that  and  focus  on  the  problem  of  landing  the  stone  at  the  correct  distance 
from  the  launch  point.  Suppose  that  a  pointlike  mass  is  thrown  so  as  to  land  within  the 
distances  d  ±  b.  We  neglect  air  resistance  and  use  the  result  for  the  parabolic 
trajectory,  i.e.,  we  require  that 

d-b<  ^  sin(2a)  <  d  +  b  (11) 

O 

When  d  =  20  m  and  b  =  10  cm,  the  inequality  is  fulfilled  for  all  elevation  angles  a  and 
launch  speeds  v  that  lie  within  the  shaded  area  in  Figure  3.  The  allowed  interval  in 
launch  speed,  for  an  angle  a  =  45°,  is  (14.00±0.04)  m/s.  It  is  remarkable  how  well  the 
human  brain  is  able  to  achieve  such  a  fantastic  precision.  But  it  is  not  necessary  to 
exemplify  with  a  champion.  Even  an  unexperienced  person  can  come  within  1  m  of  the 
stick  20  m  away,  after  a  short  practice.  That  requires  a  launch  velocity  (14.00±0.35) 
m/s,  i.e.,  a  variation  by  less  than  2.5  %.  For  completeness  it  should  be  noted  that  the 
distance  to  the  stick  is  measured  from  the  closest  part  of  the  stone.  Since  the  stone  has 
a  certain  size  the  requirements  on  v  and  a  are  somewhat  less  stringent  than  obtained 
in  our  model  of  a  pointlike  particle. 


6.2.  Curling 

In  curling  a  stone  slides  over  the  ice  towards  a  target  about  36  m  away.  We  analyse 
this  in  an  idealised  model  where  a  mass  m  moves  over  a  horisontal  surface  with  a 
constant  coefficent  of  friction  f.  The  speed  of  delivery  is  v.  After  a  time  t  the  mass 
stops,  at  a  distance  s.  Thus 


v  —  at  =  0 
s  =  vt  -  at2/2 


(12) 

(13) 
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Figure  3.  The  "window  of  acceptance"  for  the  launch  velocity  v  and  the  elevation 
angle  a  in  the  sport  varpa,  where  a  flat  stone  is  thrown  towards  a  stick  at  a  distance  of 
20  m  and  the  center  of  the  stone  lands  within  10  cm  of  the  stick. 


The  retardation  is  a  =  gf,  where  g  is  the  acceleration  of  gravity.  Hence 

f  =  2s/(gt2)  (14) 

The  typical  time  for  the  stone  to  slide  to  the  target  is  t  =  22  s.  With  s  =  36  m,  Eq. 
(14)  gives  f  =  0.015.  The  coefficient  of  friction  between  the  curling  stone  and  the  ice  is 
very  low! 

In  analogy  to  the  discussion  above  for  varpa  we  now  ask  what  is  the  allowed  interval 
for  the  velocity  of  delivery  if  the  stone  is  to  stop  within  a  certain  distance  interval. 
Eliminating  t  in  the  equations  above  gives 

v  =  V2gis  (15) 

With,  say,  s  =  (36t0.5)  m  and  f  —  0.015  one  has  v  =  (3.25±0.03)  m/s.  Several  factors 
complicate  the  situation  in  the  real  world  of  curling.  The  ice  is  not  flat  but  has  bubbles 
from  being  sprinkled  with  water.  The  underside  of  the  stone  is  not  flat,  and  it  is  in 
contact  with  the  ice  only  along  its  rim.  Further,  the  stone  is  given  a  slight  rotation, 
that  leads  to  a  curl  in  the  path  and  to  complicated  frictional  forces.  During  a 
competition  the  coefficient  of  friction  may  change,  and  different  rinks  can  have 
frictions  that  vary  by  20  %  or  more.  On  the  other  hand  the  rules  allow  the  sweeping  of 
the  ice  with  brooms,  in  front  of  the  stone  while  it  slides  forward.  That  may  affect  the 
sliding  distance  by  several  meters.  However,  we  should  note  that  an  increase  in  s  by  3 
m,  in  our  model  example  only  requires  that  v  increases  from  3.25  m/s  to  3.39  m/s,  so  v 
still  has  to  be  controlled  within  a  few  per  cent. 
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6.3.  Other  sports  —  tennis,  golf  and  basketball 

We  end  by  giving  references  to  some  works  on  the  required  precision  in  other  sports. 
With  computer  techniques  one  may  realistically  calculate  the  trajectory  of  a  tennis  ball 
or  a  golf  ball,  even  when  it  is  spinning  in  air.  In  tennis  one  can  define  an  "acceptance 
window"  in  speed  and  velocity  of  a  returned  ball,  so  that  it  clears  the  net  and  hits 
inside  the  court.  An  advanced  popular  book  has  been  written  by  Brody  [8j.  He  has  also 
written  some  more  technical  articles  on  tennis  (22,231.  Similar  work,  out.  relating  to 
golf,  has  been  published  in  a  popular  and  lavishly  illustrated  book  by  Cochran  and 
Stobbs  [9j.  Holmes  [24]  has  modelled  golf  putting. 

Shots  in  basketball  are  particularly  complex  when  performed  while  the  athlete  is  in 
the  air,  since  the  launch  velocity  is  affected  by  the  translational  and  rotational  motion 
of  the  body,  as  discussed  by  Brancazio  [10,25].  Townend  [26]  has  considered  the  free 
(ie.,  penalty)  shot  in  basketball,  with  particular  emphasis  on  the  launch  angle,  the 
speed  and  the  height  of  the  launch  point. 


7.  CONCLUDING  REMARKS 


In  the  first  part  of  this  paper  we  considered  various  factors  that  the  athlete  has  no 
control  over,  but  which  will  affect  the  recorded  result.  Table  3  summarises  some  of  the 
findings.  The  columns  refer  to  (i)  the  efffect  of  a  maximum  slope  of  1:1000  of  the  field 
or  track;  (ii)  variations  in  the  acceleration  of  gravity  between  a  place  close  to  the 
equator  (e.g.,  Mexico  City,  SSo  Paulo)  and  a  place  further  to  the  poles  (e.g., 
Stockholm);  (iii)  reduced  air  pressure  at  high  altitudes  (e.g.,  Mexico  City)  compared  to 
sea  level;  and  (iv)  maximum  allowed  tail  wind  of  2  m/s.  The  numbers  given  are  of 
course  only  approximate  and  of  varying  uncertainty,  as  discussed  above.  Each  entry 
refers  to  the  size  of  the  possible  improvement  due  to  the  factor  under  consideration. 
The  units  in  Table  3  are  meter  (hammer,  long  jump)  and  second  (100  m  sprint).  It  is 
seen  that  the  combined  effect  of  the  most  favourable  inclination,  gravity,  air  pressure 
and  tail  wind  implies  a  total  advantage  of  about  1  %  of  the  recorded  result  in  hammer 
throwing  and  long  jump  and  about  2  %  in  100  m  sprint. 


Table  3 

Summary  of  advantages  related  to  various  factors,  given  in  meters  (hammer,  long 
jump)  and  in  seconds  (100  m  sprint) 


Sport 

Slope 

Gravity 

Air 

Wind 

Hammer 

0.1 

0.3 

0.5 

0.2 

Long  jump 

0.01 

0.02 

0.03 

0.03 

100  m  sprint 

<0.02? 

<0.01 

0.09 

0.13 

In  the  last  part  of  this  paper  we  considered  the  precision  by  which  an  athlete  has  to 
release  an  implement  in  order  to  succesfully  reach  a  certain  target.  Often  the  speed 
given  to  the  implement  must  be  controlled  to  within  1  %,  which  is  an  impressive 
mdication  of  how  well  the  human  brain  can  coordinate  movements. 
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Abstract 

Theory  of  vortex  sound  in  the  aeroacoustics  can  describe  characteristic  features  of 
the  waves  radiated  by  localized  vortex  motions  at  low  Mach  numbers.  Some  evidences 
and  illustrative  examples  are  presented  from  our  studies  to  show  remarkable  agreement 
between  observed  and  computed  wave  profiles  emitted  by  the  vortex  motions  with  or 
without  presence  of  solid  body.  Among  the  cases  to  be  discussed  here  are  (i)  head- 
on  collision  or  oblique  collision  of  two  vortex  rings,  (ii)  vortex- cylinder  interaction,  fiii) 
vortex-edge  interaction.  Corresponding  to  each  of  the  three  cases,  the  radiation  patterns 
are  characterized  by  the  quadrupole,  dipole  and  cardioid  type,  respectively.  Monopolar 
(isotropic)  component  detected  in  the  experiment  is  likely  due  to  the  viscous  dissipation 
of  kinetic -energy.  In  the  oblique  collision  of  two  vortex  rings,  reconnection  of  vortex¬ 
lines  occurs,  and  causes  a  violent  motion  of  the  vorticity,  resulting  in  excitation  of  a 
characteristic  type  of  acoustic  wave. 


1.  INTRODUCTION 

In  the  theory  of  vortex  sound  at  low  Mach  number  and  high  Reynolds  number  [1-8], 
the  source  flow  is  characterized  by  a  localized  vortex  motion  scaled  on  a  length  l  which 
is  regarded  as  the  vortex  size  too.  This  flow  field  is  surrounded  by  outer  wave  field 
scaled  on  the  length  A  «  cr  =  0(//M),  where  r  =  l/u ,  u  being  a  typical  velocity  of 
the  flow  field.  The  sound  speed  and  density  in  the  undisturbed  uniform  medium  are 
denoted  by  c  and  p0,  respectively.  Typical  Mach  number  M  is  assumed  to  be  much 
less  than  unity:  M  —  u/c  -C  1.  This  enables  separate  analysis  of  the  two  fields:  inner 
flow  and  outer  wave  regions,  because  of  the  compactness  of  the  source  flow,  1<  A.  The 
theory  of  vortex  sound  is  one  of  the  most  successful  parts  in  the  theory  of  aerodynamic 
sound  [9-12]. 

In  our  investigation  of  the  vortex  sound  in  the  past  decade,  evidences  showing  the  va¬ 
lidity  of  the  theory  are  accumulating.  Our  experimental  and  theoretical  studies  include 
three  typical  cases.  The  first  one  is  the  sound  generated  by  two  interacting  vortices  in 
free  space  in  the  absence  of  external  body:  i.e.  (»)  head-on  collision  and  (ii)  oblique 
collision  of  two  vortex  rings.  In  the  first  stage  of  the  collision,  the  vortex  interaction  is 
almost  inviscid.  This  is  followed  by  the  second  viscous  stage  :  cancellation  of  opposite 
signs  of  vorticity  at  the  time  of  collision,  reconnection  of  vortex-lines,  or  turbulent  dis¬ 
integration.  Acoustic  waves  generated  by  these  motions  were  detected  experimentally 
and  investigated  in  detail.  The  second  case  is  the  sound  wave  generated  by  a  vortex 
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interacting  with  a  solid  body  of  length  scale  l.  Presence  of  an  external  body  in  the 
vicinity  of  an  eddy  causes  emission  of  waves  of  the  dipole  type  [13],  whereas  the  waves 
in  the  former  case  are  characterized  as  the  quadrupole.  The  third  case  considered  below 
is  the  wave  generation  by  a  vortex  ring  moving  near  an  edge  plate  whose  size  is  larger 
than  the  scale  A.  Remarkable  feature  of  the  wave  field  are  the  effects  of  the  edge  and 
non-compactness  of  the  plate,  giving  rise  to  the  scattering  of  waves  by  the  edge  plate. 
Here  the  temporal  wave  profile  depends  on  the  time  history  of  the  vortex  motion  [29]. 

In  the  present  paper,  asymptotic  representation  of  the  velocity  potential  of  the  inner 
flow  is  sought  first,  and  then  the  solution  of  the  outer  wave  equation  is  determined  so 
as  to  be  matched  to  the  inner  solution.  This  method  is  especially  successful  to  the  first 
case  of  the  vortex  sound  in  free  space,  and  also  applicable  to  the  second  case  of  compact 
body.  The  third  case  must  be  treated  differently  because  of  the  effect  of  scattering.  Main 
interest  here  is  comparison  of  the  wave  profiles  thus  determined  computationally  with 
the  corresponding  profiles  detemined  experimentally.  The  vortex  motions  considered 
are  three  dimensional.  Although  two  dimensional  problems  of  sound  generation  by 
rectilinear  vortices  were  studied  in  the  early  times  of  development  ([2],  see  [10]  for 
reviews),  experimental  observations  and  validation  are  rare. 


2.  MATHEMATICAL  FORMULATION 

Basic  equation  of  the  aerodynamic  sound  is  given  by  the  Lighthill’s  equation  [9j: 

(1) 

where  T{j  =  pv,Vj  +  (p  -  c2p)6;j  -  r%J  , 

p  is  the  pressure,  p  the  density,  Vi  the  i-th  component  of  velocity  and  the  viscous 
stress  tensor.  Summation  convention  is  assumed  on  the  right  hand  side  (rhs)  of  (1)  and 
henceforth. 

In  the  inner  region  scaled  by  the  length  l  and  time  r,  the  ratio  of  the  two  terms 
on  the  left  hand  side  is  estimated  as  |pM|/]c2V2p|  =  0(/2/c2r2)  =  0(M2).  In  the  first 
approximation  neglecting  terms  of  0(M2)  and  hence  the  term  pti,  the  equation  (1) 
reduces  to  the  Navier-Stokes  equation  for  an  incompressible  fluid,  governing  solenoidal 
vortex  motion  treated  as  a  source  flow. 

The  solenoidal  velocity  field  v{x,  t )  can  be  represented  in  terms  of  a  vector  potential 
A(x,t):  v  —  V  x  A.  Given  the  vorticity  u>(x,t),  the  vector  potential  is  expressed  as 

<2> 

( e.g .  see  [14]).  At  a  large  distance  r  —  jxj  away  from  the  localized  vorticity  region  having 
either  a  compact  support  of  w(y,t)  or  such  a  property  that  jo;)  decays  expoentially  as 
r  — >  oo,  the  factor  1/ ]ar  —  yj  can  be  expanded  in  a  Taylor  series  which  is  convergent  for 
|y|/M  <  1,  and  the  velocity  field  v{x,t)  tends  to  be  represented  asymptotically  by  a 
velocity  potential  $.  Asymptotic  expansion  of  the  potential  4>(ar,t)  which  is  obtained 
after  some  nontrivial  calculation  for  rjl  — >  oo  is  given  in  the  form  [15], 

$(x,t)  ~  $0(aM)  +  Qidi-  +  Qijdidj-  +  Qi]k  didjdk-  +  0(r~5)  , 

r  r  r 


Pu  ~  c2V2p  =  s-s-  Tij 


_ _ T.. 

dxidxj  ‘J 


(3) 
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where  di  =  d/dxi ,  and 


Qi(t)  =  ~  J(V  x  w)id3y  , 

QiA*)  =  -  /(»  x  dV  . 

Q«*(0  =  3^:  f(y  x  w)i»j»*dV  - 


(4) 

(5) 

(6) 


The  vector  a  =  (*<)  is  used  for  the  point  of  observation  and  y  =  ( y; )  for  the  point  of 
integration  (the  souce  point).  It  is  almost  trivial  to  show  the  following  properties, 

<?«(<)=  0,  Qu t(<)=0,  Qm(t)  =  0  (7) 

by  the  orthogonality  of  the  two  vectors  y  and  y  x  <u.  The  resultant  impulse  of  the  vortex 
system  is  defined  by  47rQi,  which  is  an  invariant  of  motion  for  unbounded  fluid  wihtout 
external  body. 

The  first  term  $0(x,t)  represents  the  effects  which  cannot  be  represented  by  the 
expression  (2)  (e.g.  fluid  compressibility  or  presence  of  a  solid  body  in  the  inner  region). 
The  compressibility  of  the  source  flow  (  |  div  uj  being  0(.M2))  is  assumed  to  be  localized, 
apart  from  the  generated  acoustic  wave  which  is  of  larger  scale  A  l  and  treated  in 
the  next  stage.  From  the  theory  of  irrotational  motion  with  localized  rate  of  expansion 
A  =  div  v,  the  first  term  $0  is  given  by  the  following  asymptotic  expansion  [14], 

$0  =  i*o(0-  +  jfc(0  9i-  +  RiAt)  Mi-  +  Riik(t)  MA-  +  o(r~5)  (8) 

r  r  r  r 

where  R0,  Ri,  Rij,  •  ■  •  are  all  functions  of  time  t  and  determined  by  the  distribution  of 
A(«c,  t ).  Presence  of  a  solid  body  in  the  vicinity  of  the  vortex  motion  can  be  represented 
by  an  additional  velocity  potential  whose  asymptotic  expansion  is  again  of  the  form  (8) 
[14].  In  the  case  of  solid  body,  the  first  monopole  term  must  vanish  by  the  condition  of 
no  net  outflow  over  the  body  surface.  Thus  both  effects  of  compressibility  and  presence 
of  solid  body  are  represented  by  the  expansion  (8). 

Corresponding  pressure  field  at  large  distances  from  the  vortex  motion  is  given  by 
the  linear  approximation,  p^(x,i)  =  —p0$t,  namely 


P(,)(x,  t )  =  -po-Po  -  -  poPi  0i-  -  p0Pij  didj~  -  p0Pnk  didjdk -  +  0(r  5)  (9) 

t  v  r  v 

where  the  over-dot  denotes  a  time  differentiation,  and  Po(0  =  Rq  ,  P(0  —  Qi  + 
Ri ,  Pij(t)  —  Qij  +  Rij ,  Pijk(t)  —  Qijk  +  Rijk  >  •  *  •  • 

In  the  outer  region,  the  rhs  of  (1)  becomes  vanishingly  small  and  the  pressure  is  gov¬ 
erned  by  the  wave  equation  :  pu  —  c2V2p  =  0,  because  of  the  (assumed)  adiabatic  relation 
dp  —  c2dp.  The  wave  pressure  pW  matching  to  the  inner  solution  (9)  asymptotically  as 
|  x  |  /A  — ♦  0  is  given  by  the  multi-pole  expansion, 


=  -p0  —  —  -  p0di 


PAtr) 


Po  did] 


PAtr) 


-  podidjdk 


Pijki.tr ) 


+ 


(10) 


where  tr  —  t  —  rjc  is  the  retarded  time.  This  is  verified  as  follows. 
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Obviously,  each  term  of  (10)  satisfies  the  wave  equation.  Matching  of  the  two  solu¬ 
tions  p M  and  pW  is  carried  out  in  an  intermediate  region.  In  terms  of  the  inner  variable 
defined  by  the  normalization  =  Z{/1  (and  r  =  r/l),  the  outer  variable  i*  =  z</A  and 
intermediate  variable  &  are  given  by 

xi  =  Mxi  =  ,  &  =  Maxi  =  ( Ma/l)  xi  (11) 

where  a  is  a  parameter  in  the  range  0  <  a  <  1.  Normalized  retarded  time  is  t,  =  tr/r  = 
i  —  f  =  i  —  Mx~a£,  where  t  —  t/r,  f  =  r/A,  £  —  Ma~1r  =  Maf.  Using  we  have 
f  =  M~a£  and  f  —  Thus  in  the  limit  M  — >  0  with  keeping  £  and  &  fixed  (the 

intermediate  limit),  we  obtain  r  —*  o o  and  f  — »  0.  As  an  example  we  consider  matching 
of  the  second  order  term.  The  third  term  of  the  inner  pressure  (9)  is  written  as 
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whereas  the  third  term  of  the  outer  wave  pressure  (10)  is 
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where  the  superscripts  (i)  and  (w)  are  used  to  distinguish  the  functions  in  the  two 
regions.  It  is  found  that  both  expressions  have  the  same  order  of  magnitude  M3"  and 
we  have  in  the  intermediate  limit  as  M  — *  0.  Crow  [4]  shows  that  the 

second  term  of  the  order  0(Ma+2)  in  the  parenthesis  [  ]  on  the  rhs  of  (12)  can  be 
matched  to  the  term  in  the  next  approximation  (effect  of  compressibility,  e.g.  the  term 
pu)  of  the  inner  expansion.  Precisely  speaking,  an  arbitrary  term  C(t)6jj  can  be  added 

to  Pi"\i)  in  the  leading  order  matching.  If  so,  in  the  next  order  0(M3“MJ^“a^)  = 
0(Af“+J),  the  second  term  in  [  ]  of  rhs  of  (12)  will  include  the  additional  term  of  the  form 
CV2(£/2)  —  C /£.  This  arbitrariness  drops  out  when  we  introduce  a  monopole  term  of 
the  form  =  —(M2  /l2)C(t)  by  the  consistency  argument.  Then  the  combined  outer 
solution  —C(t  —  f)/r  -f  V*[C'(t  —  f)/f]  vanishes  identically  [4j.  Matchings  of  the  other 
terms  are  verified  similarly.  In  particular,  in  the  matching  of  the  fourth  terms,  another 
arbitrariness  in  the  dipole  term  comes  in,  but  can  be  dropped  out  by  the  same  reasoning. 

The  origin  of  the  first  monopole  term  of  (10)  is  considered  by  Kambe  [16]  (see  also 
[17,  18]).  It  is  shown  from  the  dynamical  equation  of  motion  (Navier-Stokes  equation) 
that 

m  =  “TiT  7  *(<>  ’  M<)  =  1 /»'(y. «)i*y  («) 

where  K  is  the  total  kinetic  energy  and  7  the  ratio  of  specific  heats  (7  =  7/5  for 
the  air).  We  assume  that  significant  effect  of  the  compressibility  of  the  vortex  motion 
appears  only  in  this  isotropic  term  due  to  the  assumption  of  the  compact  source  flow  and 
M  Cl,  mentioned  in  the  beginning.  The  rest  factors  P, ,  Py  ,  P^k  ,  •  •  ■  of  the  multipole 
components  of  p(w)  are  given  by  Qi ,  Qij ,  ,  •  •  •  associated  with  the  vorticity  or  and 

Ri ,  Rij ,  Rijh  ,  •  •  •  representing  the  influence  of  a  solid  body.  The  functions  Qi ,  Qij  ,  •  •  ■ 
are  integrals  of  the  moments  of  the  vorticity  v(y,  <)  as  given  in  (4)-(6).  Thus  the 
multipole  components  of  the  generated  wave  are  expressed  in  terms  of  the  vorticity 
of  the  source  flow.  This  is  called  as  vortex  sound.  The  the  second  term  of  (10)  represents 
dipole  emission  due  to  change  of  the  total  impulse  4ir Qi  and/or  presence  of  a  solid  body. 


243 


3.  COLLISION  OF  TWO  VORTEX  RINGS 

We  first  consider  the  vortex  sound  in  an  unbounded  fluid  with  no  solid  body,  gener¬ 
ated  by  collisions  of  two  vortex  rings.  In  this  situation,  the  impulse  AnQi  is  conserved, 
and  hence  the  dipole  term  disappears.  In  fact,  the  dipole  emission  (considered  in  the 
next  section)  is  related  to  the  rate  of  change  of  the  resultant  external  force  F^t)  exerted 
on  the  fluid  [13]  which  is  absent  by  the  assumption.  Therefore  Pij  and  are  given 
only  by  Qij  and  Q; ^  respectively.  The  pressure  in  the  acoustic  far-field  (f  — >  oc 
much  simplified  in  its  form  because  the  space  derivatives  applied  to  r~1  in  (10)  becon.e 
higher  order  of  smallness  than  those  applied  to  the  functions  of  tr  as  (r  — >  oo).  Thus 
the  pressure  observed  at  a  point  x  =  (xi,X2,x3)  in  the  acoustic  far-field  is  given  as 


XiXjXk 


where  superscript  (n)  denotes  the  n-th  time  derivative.  The  second  quadrupole  term 
(Mohring’s  quadrupole  [6])  can  be  shown  to  derive  from  the  non-isotropic  part  of  the 
Reynolds  stress  poV;Vj  [16].  The  conservation  of  the  resultant  moment  of  impulse  (an¬ 
gular  impulse)  leads  to  the  symmetry  property  Qij  =  Qji  (and  =  0,  etc.  by  (7)). 
The  first  isotropic  (monopole)  term  arises  when  the  total  kmetic  energy  K  of  the  sys¬ 
tem  changes,  but  vanishes  identically  in  an  inviscid  fluid,  ilere  we  have  written  the 
formula  up  to  the  third  order  terms.  An  experimental  observation  (described  in  §  3.2.) 
is  showing  existence  of  this  order. 

Using  the  length  scale  l,  the  vorticity  scale  u>  =  u/l  and  the  time  scale  r  =  l/u,  the 
scaling  law  of  the  wave  pressure  of  the  quadrupole  sound  is  deduced  as  follows.  The 
tensor  Qij  is  normalized  by  lAu  and  hence  by  l4ix/r3  =  lit4  (the  first  monopole  term 
gives  the  same  by  (13)  ).  Thus  we  find  the  scaling  law  for  the  quadrupole  sound  as 


PQ 


Po  iu4  1  _  Pou 4  l_ 
c2  r  c2  r 


The  sound  intensity  Iq  is  given  by  Pq/pqC.  Hence  we  obtain  the  well-known  intensity 
law  [9]:  Iq  ~  (p0u8 / cs)(l /r)2  oc  it8. 


3.1.  Head-on  collision 

Axisymmetric  collision  of  two  vortex  rings  [17]  is  a  particularly  simple  example  of 
the  vortex  sound.  In  an  inviscid  fluid,  the  first  term  of  the  formula  (14)  vanishes  and 
the  second  quadrupole  term  reduces  to 


Figure  1.  Directivity  of  the  acoustic  Figure  2.  Head-on  collision  of  two  vortex 

pressure:  1  -  3  cos2  6.  rings  (definition  sketch). 
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p  =  _i&Q(3){ir);(1~3cos2^  (15) 

where  the  vorticity  is  assumed  to  have  only  azimuthal  (^-component  u>(<r,  z)  in  the  cylin¬ 
drical  coordinate  system  (z,cr,$),  leading  to 

Q(t)  =  J w(cr,z)  cr2z  d adz  ,  (16) 

with  z  —  x3  and  <r  =  \Jx\  +  x\  and  8  =  arccos (z/r).  In  this  case  the  temporal  behavior 

of  the  pressure  is  described  by  the  single  scalar  function  of  3rd-order  derivative  Q^(t), 
and  spatial  distribution  is  given  by  the  axisymmetric  four-lobe  directivity  (Figure  1). 

(а)  Computation 

Suppose  that  we  have  two  vortex  rings  having  a  common  symmetry  axis  z  with  one 
vortex  being  a  mirror  image  of  the  other  with  respect  to  the  plane  z  —  0  (Figure  2),  and 
that  they  approach  each  other  according  to  the  equation  of  motion.  When  the  vortex  is 
characterized  by  its  strength  — r,  ring  radius  R(t )  (core  radius  6)  and  the  distance  Z(t ) 
from  the  symmetry  plane  z  =  0,  the  interacting  motion  of  two  vortex  rings  of  very  thin 
cores  is  described  by  a  system  of  first-order  ordinary  differential  equations  [19]  originally 
derived  by  Dyson  [20].  Then  the  profile  function  Q{t)  is  expressed  by  —  2TR2{t)Z(t). 
The  wave  pressure  form  is  given  by  which  is  calculated  numerically  by  solving 

this  system  of  differential  equations  and  shown  by  the  curve  I  in  Figure  3,  together  with 
the  observed  one  E  (see  (6)  ).  When  the  two  vortices  come  close  to  each  other,  forward 
speed  \Z(t)\  decreases,  but  the  ring  radius  R(t)  grows  rapidly,  and  the  cores  of  the  two 
vortices  get  closer  to  come  into  contact  [19].  The  first  peak  of  the  profile  corresponds 
to  the  initial  inviscid  stage  of  the  colliding  motion. 

Effect  of  finite  core-size  and  core  deformation  on  the  wave  profile  is  studied  in  detail 
by  the  contour  dynamics  [21].  This  analysis  suggests  that  the  dip  observed  in  the 
curve  — Q^(t)  is  due  to  core  deformation,  although  the  core  size  used  in  the  numerical 
simulation  seems  to  be  larger  than  in  the  experiment  and  the  fluid  is  inviscid. 

A  numerical  simulation  of  axi¬ 
symmetric  viscous  vortex  collision 
was  carried  out  to  estimate  the 
profile  function  shown  by 

the  curve  V  in  Figure  3,  at  the 
Reynolds  number  Re  «  1.3  x  103, 
based  on  the  initial  translation 
velocity  U  =  and  ring  dia¬ 

meter  2i2o  =  2R(0)  [31]. 

(б)  Observation 
Experimental  observations  of 

the  corresponding  acoustic  waves 
due  to  the  vortex  collision  are 
reported  in  [17]  and  [22],  In  the 
latter  study,  the  waves  were 
observed  in  all  directions  at 
0  =  10°  to  350°  with  10°  interval 
in  a  meridional  plane  including 


s' 

Q(3  )(t)  /  1 


Figure  3.  Temporal  profiles  -<?^(t)  (normalized): 
I,  inviscid;  E,  experimental;  V,  viscous  simulation. 
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the  symmetry  axis  i3. 

The  observed  pressures  at  35  stations  are  represented  in  the  truncated  Fourier  series: 
p{6,  t )  =  flo(t)  4-  ai(t)  cos  6  +£>i(i)  sin  6  +  a2(t)  cos  26  +  £>2(0  sin  26.  The  two  terms  ao(0 
and  a2(t)  cos  2 6  are  found  to  be  dominant  and  rewritten  as 

Pmq(^,0  =  Pm(0  +Pq(0(!  ~3C0S2^)  , 

where  pm  and  pq  represent  the  monopolar  and  quadrupolar  components  respectively. 
Significant  amplitudes  of  pm(t)  and  pq(0  are  detected  (Figure  4).  This  experimental 
study  suggests  that  there  exists  monopole  component  associated  with  dissipation  of 
the  kinetic  energy  (13).  The  Reynolds  number  Re  is  of  the  order  104  or  larger  in 
this  experiment.  A  general  feature  of  the  observed  curve  pm  implies,  helped  by  (13)  and 
(10),  that  the  kinetic  energy  K  decreases  like  a  step  function  (or  rather  like  the  function 
—  tanh)  at  the  time  around  t  ~  2000(ps). 

An  experimental  visualization  in  color  was  made  recently  in  water  at  lower  Reynolds 
number  of  the  order  103  [23].  This  clearly  shows,  when  the  two  vortex  rings  are  close  to 
one  another,  reconnection  of  vortex  lines  and  formation  of  smaller  rings  by  instability. 
At  higher  Reynolds  number,  turbulent  disintegration  is  observed. 


1200  1600  2000  2400  (ps) 


Figure  4.  Observed  temporal  profiles  of  pm(i)  and  pq(t). 

3.2.  Oblique  collision 

Oblique  collision  of  two  vortex  rings  at  right  angles  is  studied  experimentally  and 
computationally  [15].  Evidently  this  oblique  collision  (Figure  5)  has  no  axisymmetry 
like  that  in  the  previous  case  of  head-on  collision.  This  requires  an  increased  amount 
of  data  for  the  analysis.  At  the  oblique  collision,  opposite  senses  of  vortex  lines  are 
forced  to  come  in  contact  at  the  inner  part.  This  event  is  followed  by  violent  mo¬ 
tion  of  vorticity  and  excitation  of  acoustic  waves.  Analysis  of  the  wave  data  provides 
some  information  of  the  complex  vortex  motion  associated  with  vortexline  reconnection, 
sometimes  turbulent. 

Consider  a  problem  that  the  initial  state  is  given  in  such  a  way  that  two  vortex  rings 
are  set  to  move  along  the  paths  intersecting  at  right  angles  at  the  origin  and  collide  with 
one  another.  The  bisecting  straight  line  between  the  two  paths  of  the  vortex  center  is 
taken  as  the  polar  axis  6  =  0  (along  the  x3  axis)  of  the  spherical  coordinate  system.  The 
plane  perpendicular  to  the  2:3  axis  is  the  (xi,x2)  plane  on  which  6  =  n/2.  There  are 
two  symmetry  planes  including  the  x3  axis:  one  includes  the  trajectories  of  the  vortex 
centers  which  is  defined  as  (x2,x3)  plane  and  the  plane  (*1,2:3)  perpendicular  to  it  is 
also  a  symmetry  plane  which  bisects  the  two  trajectories.  The  plane  <f>  =  0  is  taken 
along  the  positive  *i  axis.  Thus  the  two  vortex  rings  (centers)  move  toward  the  origin 
along  the  direction  of  the  angles  ( 6,<f> )  =  (7r/4,7r/2)  and  (7r/4,37r/2)  before  colliding 
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Figure  5.  Schlieren  photographs  of  the  oblique  collision  of  two  vortex  tings 
(made  by  shock  impulse  from  the  two  nozzles  seen  above). 

interaction.  From  the  geometrical  arrangement  just  mentioned,  the  acoustic  pressure 
p(Ot<f>,t)  is  characterized  by  the  symmetry: 

p(6,-<t>,t)  =  p{6,4>,i)  ,  p{9,<j>  +  v,t)  =  p(9,<f>,t)  .  (17) 

The  formula  (14)  is  rewritten  by  using  the  spherical  polar  coordinates  (r,9,<fi):  Zj  — 
r  sin  9  cos  <f>,  *2  =  r  sin  0  sin  <f>,  xa  =  r  cos  9.  Then  the  n-th  order  form,  like  the  expression 
JFa  =  CijXiXj/r 2  for  n  =  2  or  Fa  =  CijkXiXjXk/r3  for  re  =  3,  can  he  represented  in  terms 
of  the  n-th  order  (or  lower  order)  spherical  harmonics, 

^n(0,  ^n(0(c°s^>  sin<£),  •••,  P"(C)(cosn<£,  sin n<f>),  (18) 

where  (  =  cos 9,  P°(C)  and  P*( £),  (k  —  l,>*-,n)  are  the  Legendre  polynomials.  In 
fact,  we  have  F 2  =  (l/3)( — Cn  —  C22  ■+■  2l/33jP2j  ■+■  (i/6)(C7ii  —  ^22/^2  cos  2^/  -1-  ^  linear 
combination  of  P3  cos  <f>,  P2  sin  <f)  and  P2  sin  2<f>  ).  Similarly,  regarding  the  third-order 
form  F3,  we  give  an  explicit  expression  to  the  terms  to  be  used  later: 

F3  =  -(2Q333  —  Q113  —  <$223)^3  +  2q(<?113  —  Qz23)Pl  cos  ‘^4>  4-  •  •  •  (19) 

where  Q\^\  are  used  in  place  of  C,jk ,  and  the  tilde  symbol  denotes  Qm  =  4-  4- 

Qm,  etc..  The  coefficient  of  P3°  of  (19)  reduces  to  Q^3  -  (l/5)Q3*i  by  the  relations  (7). 
In  view  of  the  symmetry  (17),  the  pressure  is  represented  as 

p(6,<f>,t )  =  A0(t)  +  J41(t)P2°(cos  4-  A2(t) P* (cos  9)  cos  2<j> 

+Bi(t)Pa(cos0)  +  Bi(t)P3(cos 9) cos2(f>  ,  (20) 

where  higher  order  terms  are  omitted  since  observed  amplitudes  are  not  significant. 
Thus  it  is  found  that  the  far-field  acoustic  pressure  (14)  is  represented  in  terms  of  the 
five  normal  modes  with  five  coefficient  functions  of  time  [-4o(0>  Ai(t),  A2(t),  Bi(t),  P2(*)]- 
Here  the  Legendre  functions  are  P2°  =  (l/2)(3cos2#  -  1),  P}  =  3sin20,  •••,  and 
P°  =  (l/2)(5cos3  <?  —  3cos0),  P32  —  15(cos 9  -  cos3  9),  . 
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(a)  Observation 

The  acousttic  waves  emitted  by  the  90°  collision  were  detected  at  102  different 
angular  positions  on  the  three  great  circles  of  radius  r  =  620mm  on  the  three  orthogonal 
planes:  (1)  <£  =  7r/2,37r/2;  (2)  <f>  =  0,7r;  (3)  9  =  x/2  .  The  trajectories  of  the  vortex 
cores  in  the  (x2,x3)  plane  observed  by  a  photosensor  are  shown  in  Figure  6,  where  the 
ring  radius  jRq  =  R(0)  of  the  single  (unperturbed)  vortex  is  4.7mm.  (  U  —  27m/s) 


vortex  ring  x  vortex  ring 


Figure  6.  Observed  core  trajectories  of 
two  colliding  vortices  in  the  (x2,x3) 
plane  including  the  vortex  center. 
The  marked  positions  a  and  b 
correspond  to  the  times  of  Figure  7 
(broken  lines). 


Figure  7.  Main  mode  amplitudes  of 
observed  waves. 


The  symmetry  relation  (17)  is  found  to  be  consistent  with  the  observed  data.  In 
fact,  the  acoustic  pressure  detected  in  the  plane  9  ~  7r/2  can  be  expanded  into  Fourier 
series  with  respect  to  the  angle  <£,  and  it  is  found  that  the  Fourier  coefficients  of  sinm<£ 
(to  =  1  ~  4)  and  cos  m'<$  (to'  =  1,3, 4, 5)  are  negligible. 

Three  sets  of  profile  functions  of  \A0,  A^,  A2,  Bi,  B2]  can  be  determined  from  the 
observed  data.  It  is  found  that  they  coincide  almost  with  each  other.  These  profiles  are 
shown  in  Figure  7. 

( b )  Computation 

Numerical  simulation  of  the  vortex  collision  at  right  angles,  which  corresponds  to  the 
experiment,  was  carried  out  by  M.  Takaoka  in  order  to  estimate  the  acoustic  emission, 
using  the  method  of  [24].  The  incompressible  Navier-Stokes  equation  is  solved  numeri¬ 
cally,  together  with  the  continuity  equation,  by  the  spectral  method  on  643  grid  points. 
Imposed  boundary  conditions  are  2n  periodicity  in  the  three  coordinate  directions. 

The  isotropic  component  is  proportional  to  the  second  time  derivative  of  the  kinetic 
energy,  and  the  auadrupole  components  and  higher  modes  are  related  to  the  change  of 
moments  of  vorticity  distribution.  Using  the  data  from  the  simulation,  we  can  calculate 
the  tensors  Qij(t)  and  Q,y*(t)  of  (5)  and  (6).  Thus  we  readily  obtain  the  main  mode 
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coefficients  [A0,  Ai,  A2,  Bi,  B2\.  The  main  mode  amplitudes  of  the  computation  and 
the  observation  are  qualitatively  similar,  but  quantitatively  different,  probably  because 
of  the  difference  of  the  values  of  the  Reynolds  number.  Unfortunately,  the  Reynolds 
number  of  the  computer  simulation  is  smaller  by  two  orders  of  magnitude  than  that  of 
the  experiment. 


(c)  Implication  of  dominant  wave  components 

Observed  amplitudes  Ax  and  A2  of  the  two  quadrupoies  P2°  and  Pf  cos2<j>  in  (20) 
are  significantly  large.  The  amplitudes  and  .42(t)  are  considered  as  consisting  of 

three  longitudinal  quadrupoies  Qn,Q22,Qi3  (to  the  directions  xi,x2,x3),  as  understood 
from  the  form  of  the  coefficients  of  F2.  It  is  probable  that  there  exists  non-negligible 
amplitude  in  the  monopole. 

Out  of  the  two  third-order  components,  the  amplitude  B2  of  Pf  cos  2$  is  negligibly 
small,  but  the  appearance  of  the  mode  P3  is  substantial.  To  see  the  significance  of  this 

mode  Pg,  we  examine  its  coefficient  B i  which  is  given  by  —  -  Q3ti(0'  where 

Qszs  =  ~  j{y  x  <*>)3y32d3y,  j (y  x  w)3y2d3y  , 

p3  ~  (y  x  cu)3  =  yiw2  -  y2w i  =  y.  x  , 

the  vectors  y,  =  ( y\,y2 )  and  tu.  =  (u;i,w2)  being  projection  of  y  and  u>  to  the  plane 
(xi,x2).  Restricting  our  attention  to  the  location  of  vortexline  reconnection  at  the  inner 
part  of  the  collision  and  reminding  the  geometry  of  the  vortexlines,  it  is  expected  that 
the  variable  p3  (near  y,  =  0)  changes  its  sign  before  and  after  the  reconnection,  and 
that  its  rapid  change  will  give  substantial  contribution  to  the  terms  <2333(t)  and  <?&(<)• 
Thus  it  is  suggested  that  the  significant  amplitude  B}(t)  observed  in  the  experiment 
(see  Figure  7)  is  representing  an  acoustic  signal  of  a  local  violent  motion  a*  the  inner 
part  of  the  collision.  The  observed  acoustic  pressure  show's  a  characteristic,  asymmetric 
directivity  at  the  times  when  the  Bx  mode  is  substantial  (see  Figure  8).  As  a  test 
observation,  we  took  a  shadowgraph  (Figure  9)  of  the  w'aves  generated  by  the  core 
collision  with  the  vortex  speed  much  higher  than  that  of  the  acoustic  measurement.  It 
is  supposed  that  the  w’ave  generation  corresponds  to  the  phase  a  of  Figure  8. 


Figure  8.  The  asymmetric  directivity 
of  the  acoustic  pressure  at  the  times 
a  and  0  marked  in  Figure  7  : 
positive  (o),  ncgative(«) 


Figure  9.  Waves  generated  by  the  core 
collision  at  a  higher  vortex  speed 
(taken  hv  T  Minot  a) 
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4,  INTERACTION  OF  A  VORTEX  RING  WITH  A  BODY 

Presence  of  a  solid  body  in  a  flow  causes  a  considerable  increase  in  the  intensity 
of  generated  sound.  The  degree  of  increase  depends  on  the  boundary  conditions.  If 
the  body  is  finite  with  its  dimension  being  small  compared  with  a  typical  wavelength, 
then  the  sound  wave  generated  by  nearby  eddies  is  characterized  by  a  dipole  emission 
whose  intensity  is  proportional  to  the  sixth  power  of  the  typical  velocity  u.  On  the 
other  hand,  the  intensity  of  sound  generated  by  eddies  in  the  vicinity  of  a  sharp  edge 
of  a  flat  semi-infinite  plate  (considered  in  the  next  section)  is  proportional  to  the  fifth 
power  of  u  and  the  wave  field  takes  a  cardioid  directivity  pattern.  [  see  [7]  for  general 
formulation;  [25]  for  a  circular  cylinder;  [26]  for  a  sphere  ;  [27]  for  a  wedge.] 

Solenoidai  vortex  motion  in  the  neiborhood  of  a  solid  body  at  rest  is  dominated  by  a 
dipole  field.  This  is  interpreted  as  follows.  The  first  and  second  terms  of  (8)  disappear 
by  the  conditions  of  vanishing  net  outflux  across  the  body  surface  and  no  translational 
motion  of  the  body  [14],  respectively.  However,  the  impulse  47rQ,  (the  coefficient  of  the 
second  term  of  (3)  )  changes  according  to  the  momentum  exchange  between  the  body 
and  the  surrounding  fluid.  The  rate  of  change  of  47r<5,(t)  is  given  by  the  resultant  force 
Fi(t)  exerted  on  the  fluid  by  the  relation,  47r Qi  —  Fx  (see  [19]). 

It  is  shown  [7]  that  the  temporal  wave  profile  is  determined  by  the  following  integral 
which  is  a  function  of  time  t: 

ni(t)  =  J  “(y,t)  ■  ^,(y)dzy  (21) 

where  is  the  vorticity  of  the  source  flow  and  the  function  ^(y)  (i  =  1,2,3)  is  a  vector 
potential  (with  div  =  0)  for  the  velocity  of  an  imaginary  potential  flow  (around  the 
body)  of  a  unit  velocity  at  infinity  in  the  i-th  direction.  The  function  ^,(y)  represents 
the  presence  of  the  body  in  the  vicinity  of  the  vortex  motion. 


4.1  General  formula 

If  the  eddy  and  the  body  without  sharp  edge  are  both  compact  relative  to  the 
acoustic  wavelength,  the  acoustic  pressure  obtained  by  Curie  [  1 3]  is  expressed  as 


p(x,f) 


(22) 


in  the  far-field  where  —F,(t)  is  the  force  exerted  on  the  body  by  the  surrounding  fluid, 
represented  by  an  integral  of  the  stress  over  the  body  surface.  The  expression  (22)  is 
clearly  a  dipole  wave  with  its  axis  in  the  direction  of  F,(<) 

A  different  expression  of  the  acoustic  pressure  is  obtained  by  Obermeier  [28]  and 
Kambe  [7]  for  the  same  problem.  The  pressure  generated  by  the  interaction  of  a  solid 
body  and  a  closed-loop  vortex  (a)  as 


p(x,t) 


.■(!--) 

47 rc  t1  c 


(23) 


where  the  function  fl,(t)  of  (21)  reduces  to 


11,(0 


r  f  *i-  ds  r  x  *.)  ■  n  d 5  =  rj.fo 


(21) 


The  constant  F  is  the  strength  of  the  vortex,  the  infinitesimal  variables  ds  and  dS  are 
a  line  element  of  the  closed  loop  c  and  an  elemnt  of  the  open  surface  S  bounded  by 
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cr,  respectively,  and  n  is  a  unit  normal  to  the  surface  S.  The  function  Ji(<)  denotes  the 
volume  flux  of  the  potential  flow  ^  passing  through  the  loop  <r. 

To  obtain  the  scaling  law  of  the  dipole  emission,  we  note  that  II;  is  normalized  by 
ul3  since  T  ~  ul  and  JV  x  $,j  being  of  order  unity  away  from  the  body.  Thus  the  scaling 
law  is 

p0  ul3  1  pQu3  l 

Pd  ~ - 7~  ~ - ■ 

c  t*  r  c  r 

The  sound  intensity  is  Id  ~  (pou6  /  c3)(l  /  t)2  <x  u6. 

4.2  A  vortex  passing  nearby  a  circular  cylinder 
(a)  Computation 

Consider  a  circular  vortex  ring  of  radius  R  (core  radius  6)  passing  by  the  side  of  an 
infinite  straight  circular  cylinder  of  radius  a  (Figure  10).  The  sc3  axis  is  taken  along 
the  cylinder  axis.  (In  this  case,  $3  denotes  a  uniform  flow  to  the  cylinder  axis.)  If 
the  vortex  is  sufficiently  distant  from  the  cylinder,  the  vortex  path  can  be  regarded  as 
rectilinear.  It  is  assumed  that  the  vortex  ring  keeps  its  circular  form  (approximately) 
and  the  ring  center  moves  within  the  plane  (xi,x2),  which  means  that  the  normal  to 
the  ring  plane  lies  within  the  (21,2:2)  plane.  The  above  expression  (23)  reduces  to 


/  / 

flnu/  1? 


Pob,(<£,  £) 


t  =  1695(ps) 


ii-f  -“•! 


Jj  * 

*/f/ 


/  (« 


Figure  10.  Schematic  diagram  of 
the  problem  and  the  flow  F 
to  the  observation  direction. 


Figure  11.  Dipole  character  of  the  far-field 

acoustic  pressure  p(<t>,t)  and  corresponding 
vortex  position  relative  to  the  circular 
cylinder  at  three  times,  where  U  =  27m/s, 
R  ■  4.7mm  and  a  -  4.5mm  . 


251 


p(x,£4--)  =  •— -T^-^  [Ji(£)cos<£  +  Jz(i)  sin  <f>]  (25) 

c  47 rc  r 

=  £5jr^j(<)  co.(*-e(i))  (26) 

where  J2  —  Jf  +  J|  and  tan0  =  J2/Ji,  since  J2  is  taken  to  be  zero  and  'J’i  (t  =  1,2) 
has  only  the  S3  component:  (0,  0,  $;).  The  angle  <f>  denotes  the  azimuthal  angle  of  the 
projection  of  the  position  vector  x  on  the  (sj,  s2)  plane,  measured  from  the  axis.  The 
angle  8  is  the  polar  angle  of  the  direction  *  from  the  s3  axis.  Evidently  the  directivity 
of  the  acoustic  emission  (26)  is  of  dipole  character  with  its  axis  in  the  direction  0(£) 
determined  by  and  J2.  The  coefficient  J(t)  is  the  second  time  derivative  of  the 
volume  flux  J(t)  passing  through  the  vortex  ring  of  the  flow  F  around  the  cylinder 
to  the  direction  of  observation  <f>  ( [25],  Figure  10).  Note  that  the  wave  pressure  (26) 
vanishes  toward  the  cylinder  axis  0  —  0  and  7 r.  Thus  the  non-compactness  of  the  body 
to  the  direction  of  the  cylinder  axis  does  not  Influence  the  characteristic  feature  of  the 
wave  field. 

(6)  Observation 

Corresponding  acoustic  waves  were  detected  at  various  angles  <f>  in  the  plane  8  =  90° 
and  various  angles  8  in  the  plane  (f>  ~  0°  and  180°.  Average  wave  observed  at  every  10° 
position  (in  the  plane  8  =  90°)  is  expanded  into  Fourier  series  with  respect  to  <f>.  It  is 
found  that  the  main  component  is  given  in  the  form, 

Pob.(<£,  0  =  sin  <j)  4-  b(t)  cos  (j>  , 

in  accordance  with  the  expression  (25),  and  the  dipole  character  is  clearly  seen  in  Figure 
11.  The  solid  curves  a,b  in  figure  12  are  the  observed  profiles.  The  broken  curves  show 
computed  (normalized)  amplitudes  a  =  —  J2  and  b  —  J\  for  the  vortex  ring  moving  along 
a  straight  line  parallel  to  the  x2  axis  but  with  observed  variable  speed  (the  velocity 
decays  gradually).  Agreement  in  absolute  values  between  the  observed  and  predicted 
profiles  are  fairly  good. 


Figure  12.  Comparison  of  the  coefficients  o(i)  and  b(t ):  observation  (solid)  and 
computation  with  6/ R  =0.15  (thick  broken)  and  0.1  (thin  broken). 
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5.  INTERACTION  OF  A  VORTEX  RING  WITH  AN  EDGE  OF 

A  FLAT  PLATE 

Presence  of  a  fiat  plate  with  a  sharp  edge  in  the  vicinity  of  a  vortex  alters  the  wave 
field  significantly,  as  well  as  the  inner  field  bounded  by  the  plate.  When  the  edge  plate 
is  non-compact,  the  temporal  wave  profile  depends  on  the  history  of  the  vortex  motion, 
which  is  the  scattering  effect  of  the  edge.  This  effect  is  taken  into  account  by  using 
the  low-frequency  Green’s  function  [5],  and  the  formulation  is  made  differently  from  the 
previous  ones  ([29],  [30],  [10]). 

In  the  present  case  of  a  vortex  ring  moving  near  the  edge  of  the  plate  (Figure  13,  the 
edge  being  taken  as  the  *3-axis,  the  polar  axis),  the  plate  is  placed  at  y2  =  0  (yi  <0) 
or  <f>  =  ±180°.  The  wave  pressure  generated  by  this  interaction  is  obtained  [7,  29]  as 


p(*,f) 


po  sin  \<f>  (sin#)1/2  _  r 

(27r3c)1/J  r  c 


(27) 


where  f(t)  =  ~  f  ^  da  ,  11(f)  =  T  jf  (V  x  *)  ■  nd S,  (28) 

the  latter  expression  is  obtained  from  (21)  like  (24)  with  'i '*  =  ’$'  =  (0,0,V’(y))-  The 
streamfunction  j>(y)  is  defined  by  —Y2!2  cos  1 <f> ,  representiong  a  hypothetical  potential 
flow  around  the  edge  from  below  to  above  the  plate,  where  Y  =  (y2  +  yl)1^2.  Note 
that  the  velocity  V  x  $  of  the  hypothetical  flow  is  scaled  as  L~1/2  (L  being  the  nearest 
distance  to  the  edge),  and  that  the  time  scale  for  the  flux  change  should  be  given  by 
tl  =  L/u.  Using  the  argument  which  is  similar  to  the  case  of  dipole,  we  obtain  the 
scaling  law  of  the  wave  pressure  for  the  edge  sound  as 


po  ulL~l'H2  \  poU*t2(l\2l 

PE~  c1/J  T3/J  r"  c1/2  r' 

The  acoustic  intensity  Ie  defined  by  p2E/poC  is  now  proportional  to  u~  L  *.  The  angular 
dependence  of  the  wave  pressure  (27)  is  given  by  the  function  F(8,<f>)  —  sin  (sin  9)* 


Figure  13.  Trajectory  of  the  vortex 
ring  moving  near  the  edge  and  the 
streamlines  given  by  ^  =const. 


Figure  14.  Three  functions,  f(t),  fl(t),  fl(t) 
for  the  angle  of  trajectory  a  =  —~7 r. 


BH 


mmm 


a  —  — w/2 


b  —  9.6  mm 


V  —  30.4m/s  R  —  4.7ram 


Figure  15.  Polar  plot  of  the  acoustic  pressure  Figure  16.  Perspective  profile  of 

observed  at  a  fixed  time  t0  with  0  —  irj 2.  the  observed  wave: 

The  solid  curve  is  /exp(fo)  sin  \<j>.  P.{4>,  f)  =  /«p(0  sin  \4>- 

(Figure  15).  In  the  plane  perpendicular  to  the  x3  axis  (B  —  7r/2),  the  directivity  of  the 
acoustic  intensity  is  the  cardioid,  I $  ~  sin2  on  a  fixed  circle  r  =const  and  0  =const. 

(a)  Computation 

The  temporal  wave  profile  of  the  pressure  is  represented  by  /(f)  of  (28)  : 


1  r°o  !  . 

fW~7r~  I  (— u^)513f(u;)  exp[-iu;tj  dw  = 
2  7T  J —  oo 


sj 


[26],  where  II(t)/r  is  the  volume  flux  of  the  flow  tJ)  passing  through  the  vortex  ring 
of  radius  R  (core  radius  5 ),  and  IIf(u>)  is  the  Fourier  transform  of  fl(<).  The  vor¬ 
tex  trajectory  is  a  straight  line  with  the  angle  a  to  the  yx  axis.  The  three  functions 
/(f),  n(f),  11(1)  are  shown  in  Figure  14  for  a  —  -tt/2.  It  is  observed  that  the  curves 
/(f)  are  intermediate  between  11(f)  and  11(f).  This  intermediate  behavior  exhibited  by 
the  half-integer  differential  operation  (d/df)z  is  related  to  the  scattering  of  the  wave  by 
the  non-compact  edge  plate.  It  is  interesting  to  compare  this  with  11(f)  of  (23)  for  the 
compact  case. 

( b )  Observation 

The  sound  emission  Turn  a  vortex  ring  travelling  near  the  edge  of  a  flat  plate  along 
a  nearly  straight  path  was  investigated  in  a  laboratory  experiment  [29].  Both  of  the 
spatial  profile  F(0,<^>)  and  temporal  profile  /(f)  were  found  to  agree  with  the  predicted 
ones  described  above. 

The  pressure  p(d>,  f)  observed  in  the  plane  6  —  90°  is  expressed  by  a  truncated  Fourier 
series.  It  is  found  that  the  main  component  is  of  the  form  p,(<f),t)  —  /exp(f )  sin  |<^  in 
accordance  with  (27).  Experimental  study  was  done  mainly  for  the  vortex  path  of 
a  =  — 7t /2.  Figure  15  is  a  polar  plot  of  the  pressure  p.(cf>,t)  at  a  fixed  time  f0:  p.{4>,  fo)- 
A  perspective  plot  of  the  pressure  profile  p.(<f>,t)  is  illustrated  in  figure  16.  The  half- 
plane  lies  at  B  =  ±180°,  on  both  sides  the  pressure  takes  opposite  signs.  The  vortex 
moved  from  the  positive  to  negative  y2-ax is  with  the  translation  velocity  V  =  30.4m/s, 
R  =  4.7mm  and  L  —  9.6mm. 

Figure  17  show's  comparison  between  the  observed  curve  and  computed  curves  on 
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U  (m/s) 

Figure  17.  Comparison  of  /exp(f)  and  Figure  18.  Observed  power  laws 

corresponding  computed  curves.  of  A p  (Pa)  vs.  U  (m/s). 

the  absolute  scales.  The  solid  curve  is  the  experimentally  determined  profile  ftx p(t)  and 
the  broken  curves  are  the  computed  ones  of  p/F(<f>,6  —  90°)  obtained  by  using  (27,  28) 
for  the  relative  core  size  6/R  —  0.2  (thin  broken)  and  0.3  (thick  broken).  Agreement  of 
the  two  curves  is  remarkable. 

6.  CONCLUSION 

Concerning  the  problem  of  the  vortex  sound,  a  new  formulation  is  developed  for  a 
compact  source  flow  of  localized  vorticity  distribution,  based  on  asymptotic  expansion 
of  the  velocity  potential  of  the  inner  flow.  This  is  successfully  applied  not  only  to 
the  sound  generation  by  two  interacting  vortices  in  free  space,  but  also  to  the  case 
of  wave  generation  by  a  vortex  interacting  with  a  compact  external  body.  The  third 
case  of  the  wave  generated  by  a  vortex  moving  near  a  non-compact  edge  plate  must  be 
treated  differently,  taking  account  of  the  scattering  effect.  In  each  of  the  three  cases,  the 
computed  wave  profiles  show  remarkable  agreement  with  the  observed  ones,  implying 
validity  of  the  theory  of  vortex  sound.  Figure  18  is  a  summary  of  our  experiment, 
illustrating  observed  power  laws  (in  the  log-log  plot)  of  the  acoustic  pressure  (peak) 
amplitude  A p  versus  the  translation  velocity  U  of  a  single  vortex  (in  isolated  state). 

In  the  section  3.1  of  head-on  collision  of  two  vortex  rings,  a  consistent  account  is 
given  by  relating  the  phases  of  the  vortex  collision  with  those  of  the  wave  profiles.  A 
new  feature  is  noticed  concerning  asymmetric  wave  emission  in  the  oblique  collision  in 
the  section  3.2. 

Finally,  some  remarks  are  to  be  made  about  other  successful  comparisons  between 
observation  and  computation,  which  are  out  of  scope  of  the  present  article.  Acoustic 
noise  of  dipole-type  is  investigated  recently,  based  on  the  computer  simulation  of  a 
uniform  flow  around  an  aerofoil,  and  compared  with  corresponding  experiment  at  M = 
0.23  and  Re  =  3  x  105  (with  reference  to  the  chord  and  the  freestream  velocity)  [32]. 
Fair  agreement  is  found  between  the  two  noise  spectra  obtained  from  the  computer 
simulation  and  the  experiment.  The  computed  data  shows  structure  of  the  surface 
dipole  over  the  wing  surface.  Another  study  [33]  of  a  shock  wave  interacting  with  a 
vortex  filament  also  revealed  reasonable  agreement  between  the  observation  and  the 
calculation. 
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Abstract 

Nonlinear  membrane  theory  can  be  regarded  as  an  approximation  to  the  nonlinear 
behavior  of  shells  of  finite  thickness  under  suitable  conditions,  or  else  studied  as  a 
mathematical  model,  complete  in  itself,  for  shells  which  are  formally  assumed  to  be  in¬ 
capable  of  carrying  moments.  Selected  topics  of  interest  in  the  statics  of  nonlinear 
elastic  membranes  will  be  discussed.  These  include:  the  basic  equations,  the  behavior  of 
a  membrane  adjacent  to  a  stressed  initial  state,  the  undeformed  membrane,  boundary  con¬ 
ditions,  membrane  edge  effect,  lines  of  rotational  discontinuity,  axisymmetric  mem¬ 
branes,  cylindrical  membranes,  wrinkle  fields  and  others.  The  intrinsic  form  of  the 
shell  equations  will  be  adopted  in  most  of  the  discussions.  Cases  with  small-strain  geo¬ 
metric  nonlinearities  as  well  as  some  finite  strain  problems  will  be  considered. 
Emphasis  will  be  put  more  on  general  and  formulative  issues  rather  than  on  specific 
solutions,  but  the  analysis  of  a  specific  case  will  be  outlined. 


1.  INTRODUCTION 

Membrane  theory  is  commonly  regarded  as  an  approximation  to  the  shell  problem, 
obtained  by  formally  suppressing  the  stress  couples  and  transverse  shears  in  the  shell 
equations,  compared  with  the  tangential  force  resultants  ("membrane  forces").  The 
suppression  takes  place  in  the  force  equilibrium  equations,  force  boundary  conditions, 
energy,  and  moment  equilibrium  around  the  normal  to  the  shell.  There  is  ample  experi¬ 
mental  and  theoretical  evidence  to  the  effect  that  membrane  theory  is  a  valid  "interior" 
approximation  to  the  shell  equations,  which  dominates  the  behavior  far  from  boundaries 
and  discontinuities  (either  in  the  geometry  or  the  loading),  for  deformation  processes 
with  a  typical  length  measure  V  of  the  reference  surface,  and  provided  that  there 
exists  adequate  support  at  the  shell  boundaries.  Processes  with  length  measure  v'rF, 
where  t  is  the  shell  thickness  (such  as  buckling,  short-wavelength  vibrations,  edge 
effects),  ar.d  also  those  involving  substantial  inextensional  deformations,  are  to  be 
treated  by  the  more  general  bending  theory.  Also,  bending  corrections  ■  to  the  membrane 
solution  which  are  based  on  its  deformation  patterns  can  be  calculated.  These  can  be 
used  to  estimate  the  accuracy  of  the  membrane  approximation  and  to  correct  the  stress 
levels  (which  are  of  a  lower  order  in  t/r).  Chapters  on  membrane  theory  can  be  found  in 
many  standard  books  on  shell  theory  [1-13].  Some  consider  nonlinear  behavior  speci¬ 
fically  [7-14]. 

Another  approach  to  membrane  theory  regards  membranes  as  two-dimensional  deformable 
sheets,  plane  or  curved,  which  are  formally  incapable  of  carrying  any  stress  couples  or 
transverse  shears,  and  can  sustain  tangential  force  resultants  only.  These  are  the 
"ideal”  or  "true"  membranes  (the  term  "ideal"  membranes  will  be  used  to  distinguish  them 
from  "shell  membranes"  which  can  carry  stress  couples,  as  previously  discussed).  The 
theory  of  ideal  membranes  can  be  formally  modelled  as  a  complete  entity  in  itself 

Several  important  characteristics  distinguish  the  behavior  of  ideal  membranes  from 
shell  membranes.  To  name  a  few: 
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(1)  Ideal  membranes  cannot  sustain  compressive  stresses,  whereas  shell  membranes  can 
carry  compressive  stresses  until  buckling  takes  place. 

(2)  Ideal  membranes  exhibit  "wrinkle  fields"  or  "tension  fields''  to  compensate  for  the 
nonexistence  of  compressive  stresses.  In  shell  membranes,  postbuckling  behavior  is 
out  of  the  scope  of  the  model,  but  the  approximation  of  an  "incomplete  tension 
field"  can  be  sometimes  used  to  investigate  the  transition  from  postbuckling  to 
wrinkle-fields  [15]. 

(3)  Ideal  membranes  can  accept  lines  of  angular  discontinuity  in  the  deformation 
pattern  on  the  boundary  or  in  the  interior.  In  shell-membranes,  angular  continuity 
is  preserved. 

(4)  In  the  same  vein:  existing  lines  of  angular  discontinuity  in  an  ideal  membrane  can 
be  smoothed  out  under  suitable  conditions.  In  shell-membranes,  the  theory  breaks 
down  near  such  lines. 

(5)  Large  inextensional  deformations  can  be  sustained  in  an  ideal  membrane,  but  not  in 
a  shell  membrane. 

(6)  A  large  variety  of  kinematic  and  force  boundary  conditions  can  be  imposed  on  an 
ideal  membrane  within  the  scope  of  the  model  (with  obvious  exceptions  -  such  as 
imposed  stress  couples  or  transverse  shears).  Exact  edge  conditions  can  rarely  be 
specified  for  a  shell-membrane.  Bending  or  3D  theories  are  used  for  edge  effect 
analysis,  with  component  3D,  bending  and  nonlinear  membrane  effects. 

(7)  The  "undeformed  shape"  of  an  ideal  membrane  needs  only  a  metric  specified  in  a  2D 
(curved)  space  ua(a=l,2)  plus  a  specified  closed  line  in  this  space.  It  can  be 
"packed  in  a  box".  A  shell  membrane  starts  from  a  3D  thin  walled  body  which  con¬ 
stitutes  its  undeformed  shape.  Not  all  the  geometrical  properties  of  this  shape 
appear  in  the  equations,  but  they  are  necessary  for  the  analysis  of  its 
limitations. 

The  question  as  to  whether  ideal  membranes  do  actually  exist  as  physical  entities 
need  not  be  answered:  the  analyses  of  mechanics  problems  involve  some  degree  of  ideal¬ 
ization  and  modelling  of  the  behavior  of  the  physical  world,  and  the  success  of  a  model 
should  be  measured  by  its  ability  to  predict  this  behavior  in  some  approximate  sense. 
With  this  in  mind,  several  classes  of  shell  problems  can  be  modelled  as  ideal  membranes. 
Among  those,  the  most  important  is  the  asymptotic  behavior  of  thin  shells  as  (t/r) 
approaches  zero.  In  fact,  ideal  membrane  behavior  can  be  arrived  at  solely  by  this 
limiting  process.  Other  classes  of  problems  which  can  be  modelled  as  ideal  membranes 
include  shells  made  from  special  materials  (biological  membranes,  cloth  and  other  thin 
films,  inflatables),  problems  involving  large  loads  of  the  pressure  variety,  many  large 
strain  problems,  etc. 

In  what  follows,  no  direct  distinction  between  the  two  types  of  membranes  will  be 
made.  The  typing,  if  needed,  should  be  based  on  the  problem  at  hand.  For  example,  com¬ 
pressive  stresses  will  be  included  in  the  analysis,  but  problems  related  to  wrinkle 
fields  will  also  be  discussed.  Boundary  conditions  for  an  ideal  membrane  will  be  looked 
at,  with  the  understanding  that  in  a  shell  membrane  this  may  be  just  a  component  of  a 
more  complex  edge  effect  problem. 

A  "real"  shell  is,  of  course,  a  thin  3D  continuum.  Shell  theory  is  an  approximation 
to  3D  behavior.  Membrane  Theory  is  an  approximation  to  shell  theory  and,  putting  aside 
special  cases,  the  behavior  of  ideal  membranes  is  a  subset  of  Membrane  Theory.  However, 
if  the  parameters  and  objectives  are  right,  the  latter  can  give  very  good  predictions  of 
the  behavior  of  this  3D  body  under  conditions  involving  strong  nonlinearities  which, 
otherwise,  would  have  been  extremely  difficult  to  treat. 


2.  NONLINEARITIES  IN  MEMBRANE  THEORY 


These  follow  the  usual  classification  into  geometrical  and  material  nonlinearities. 
Small  strain  theory  with  ncnlinearities  due  to  the  inextensional  component  of  the  defor¬ 
mation  and  to  local  strain  gradients  is  the  more  common  case.  Tension  field  theory  can 
be  included  within  this  class  since  the  wrinkling  process  involves  mostly  geometrical 
changes.  A  special  subclass  of  the  above  is  "rigid-inextensional"  theory  which  assumes 
that  the  strains  are  zero  and  that  deformation  can  take  place  through  inextensional 
and/or  wrinkle  fields  only. 

The  relative  simplicity  of  the  application  of  two-dimensional  hyperelastic  consti¬ 
tutive  relations  to  ideal  membranes  has  led  to  the  development  of  the  materially  non¬ 
linear  large  strain  membrane  theory  which,  in  addition  to  many  practical  uses,  can  be 
applied  to  problems  involving  the  loss  of  stability  in  tension  due  to  combined  material 
and  geometrical  effects. 

The  main  feature  that  nonlinearity  adds  to  membrane  theory  is  shape-adaptability: 
both  geometry  and  membrane  forces  participate  simultaneously  in  taking  on  the  applied 
loads.  This  is  facilitated  by  considering  equilibrium  with  respect  to  the  deformed  con¬ 
figuration.  The  process  gives  rise  to  a  sixth  order  partial  differential  system  which 
can  accommodate,  in  most  case,  three  boundary  conditions.  This  should  be  contrasted  with 
linear  membrane  theory  which  is  taken  with  respect  to  an  undeformed  geometry  and  can 
accommodate,  in  principle,  only  two  boundary  conditions.  In  some  cases  of  well  supported 
membranes  (such  that  the  boundary  conditions  prevent  inextensional  deformations),  linear 
membrane  theory  yields  good  predictions  away  from  the  boundary.  In  such  cases,  the  corr¬ 
esponding  small  strain  nonlinear  theory  predicts  the  formation  of  boundary  layers  of 
thickness  iV?  at  the  edges,  where  £  is  a  typical  strain  measure  of  the  membrane. 

A  well  studied  group  of  problems  which  belong  to  this  class  is  the  axisymmetric  de¬ 
formation  of  membranes  of  revolution.  However,  as  the  strain  increase,  the  boundary 
layers  are  obliterated  and  boundary  effects  extend  over  a  large  part  of  the  membrane. 
See  section  5  for  more  details. 

At  the  other  extreme  lie  cases  where  almost  all  of  the  deformation  is  inextensional 
and  the  contribution  of  the  strains  is  minor.  For  a  trivial  but  illuminating  example, 
consider  a  rectangular  ideal  membrane  sheet  in  the  xy  plane,  clamped  along  its  edge  x=0 
and  loaded  along  its  edge  x=a  by  a  uniform  line  load  Pz  in  the  z  direction.  In  its  de¬ 
formed  shape,  the  sheet  rotates  through  90°  and  a  line  of  angular  discontinuity  forms 
along  x=0.  The  addition  of  normal  pressure  causes  the  deformed  shape  to  be  curved  but 
the  essential  feature  of  almost  inextensional  deformation  is  preserved. 

The  "spherical  barrel"  example  of  section  7  serves  to  demonstrate  some  of  these 
cases.  This  is  a  truncated  ideal  spherical  membrane  attached  to  rigid  rings  which  are 
pulled  apart  by  forces  P.  The  membrane  collapses  in  to  form  a  cylindrical  wrinkle  field. 
For  small  P,  "rigid-inextensional"  theory  holds  with  the  wrinkles  extending  clear  to  the 
rings.  For  larger  P,  a  narrow  unwrinkled  axisymmetric  boundary  layer  is  formed  adjacent 
to  the  rings.  For  larger  P,  the  boundary  layer  no  longer  exists.  The  unwrinkled 
stretched  membrane  which  covers  a  large  part  of  the  surface  is  analyzed  by  axisymmetric 
large  strain  theory. 


3.  A  REVIEW  OF  THE  MEMBRANE  EQUATIONS 
(a)  Notations 

Tensorial  component  notation  is  adopted  in  the  general  discussion.  Physical 
components  are  used  in  examples.  Vector  notation  is  used  as  needed,  referred  to  a  fixed 
Cartesian  coordinate  system  x,y,z.  The  membrane  is  to  be  analyzed  in  three  possible 
states:  Current  state  (denoted  with  an  upper  bar),  undeformed  state  (without  bars),  and 
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reference  state  (denoted  with  a  subscript  or  superscript  (o)).  The  latter  is  a  known 
deformed  state  with  respect  to  which  incremental  solutions  are  sought. 

In  the  sequel,  a  comma,  a  double  vertical  bar,  and  a  single  vertical  bar,  denote 
partial  and  covariant  differentiations  with  respect  to  the  current  and  reference  states, 
respectively.  Upper  case  letters  denote  full  quantities  and  lower  case  letters  denote 
incremental  quantities  (from  the  reference  state).  A  membrane  is  identified  either  by 

the  position  vector  R(ua)  of  material  points  on  its  deforming  reference  surface,  or  by 


its  current  metric  and  curvature  tensors  A^p  and  Bap,  respectively,  which  satisfy  the 
three  continuity  equations  of  Codazzi-Mainardi  and  Gauss  [16,17].  Associated  quantities 


are  A=det(Aap),  permutation  tensor  eap,  unit  normal  N  and  Christoffel  symbols  Tp^,  given 

by  Tpy^A0^  A^  p+A^j3  y’^pY  corresP°n^‘n8  quantities  in  the  undeformed  and 

reference  states  are  R,  A^p  etc.  and  R^°\  A^j^  etc.,  respectively. 

The  boundary  of  a  membrane  is  a  curve  T  with  arclength  parameter  s:T:  u  =  u  (s), 

where  ds^  -  Aapduadu^. 

Unit  vectors  along  the  curve  are  X,  v,  N  where  X.a=u^-  and  va=epa^- 
The  normal  (Kn)  and  geodesic  (K  )  curvatures  along  the  curve  are: 


V  r-  =  Bap^P  Kg  =  =  ^P(^-7  +  re/0) 

n 

The  symmetric  force  resultant  tensor  acts  on  elements  of  the  current  membrane,  in 
current  directions  and  is  measured  in  terms  of  the  current  metric.  Kirchhoff 
("engineering")  resultants  act  in  the  same  directions  but  use  the  undeformed  metric. 

They  are  given  by  /  A  N°^=  J  ~K  Na^. 

The  loading  vector  per  unit  deformed/undeformed  area  is  p=p  R,a+pN/p=p  R,a+pN. 
Components  with  respect  to  undeformed  directions  (f^f)  or  to  cartesian  directions 
(f* e-)  are  occasionally  used. 

Two-point  tensors  and  vectors  associated  with  the  geometry  of  the  deformation  are  the 
Green  Strain  Tensor  Eap,  change  of  curvature  tensor  K^p  and  displacement  vector  V.  Hiese 

are  given  by  2Ea(}=Aa|3-Aa|3;  K^B^B^;  V=R-R=vaR,a+wN=v'ei. 


(b)  A  Review  of  the  Basic  Equations 

The  important  issues  are  the  choice  of  variables  and  choice  of  directions.  The 

intrinsic  form  utilizes  Aftp  and  Bap  as  geometric  field  variables  and  relates  the 

equations  to  the  current  directions.  The  displacement  form  utilizes  the  displacements 

(v°\w)  as  field  variables  and  relates  the  equations  to  the  undeformed  directions.  Other 
variants  are  possible  too.  The  equations  are  occasionally  related  to  extrinsic 
directions  such  as  the  axial  and  radial  directions  in  membranes  of  revolution  or 
Cartesian  directions.  The  intrinsic  form  is  preferred  in  the  sequel. 
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(b.l)  Equilibrium 

The  fundamental  form  is  the  vector  equation  of  equilibrium  {/  S’ Ta),a+-/  "A  p=0  where 


Ta=Na^R,p  is  the  stress  resultant  vector. 


Alternate  component  forms  with  respect  to  the  current  directions  are:  N  Hi  a+p  =0; 


-  MaP„  4.^=n- 


NaPBap+P=° 


or 


/ 


Component  forms  with  respect  to  the  undeformed  directions  utilize  the  "displacement 
gradients"  and  "rotations"  defined  by  ^{xp=vaj  p-B^w;  9a='(w>a+Bapv^-  The  equations 
are  [18]: 


[K  +  S7p]N<1la  + 


(b.2)  Kinematics  and  Compatibility 

The  Aap  and  B^p  must  satisfy  the  continuity  (compatibility)  equations  of  surface 
theory.  These  are  [16]: 

e^Y(®ap,y  *  ^ay®Xp]  =  0  Codazzi  equations 

e“l'e|i6[Sa(iiYS  +  AapiY5  +  V^s]  -  0  Ga'JSS  “luaIion- 

In  the  case  of  small  strains,  simplified  equations  are  available,  which  are  linear  in 
the  strains  and  their  first  order  derivatives  [19,20,14].  See  section  4(a). 

For  the  displacement  formulation,  the  strain-displacement  equations  are  [18]: 

E«p  “  z(Sap  +  s(Sa  +  saS7p  +  Vp]' 


(b.3)  Constitutive  Relations: 

Only  hyperelastic  materials  are  considered.  Let  «l>(Eap)  be  the  strain  energy  density 
per  unit  undeformed  area.  Then  Na^=4  ff-S —  + — ] .  Methods  for  obtaining  have  been 

ZlsEap  3EapJ 

discussed  elsewhere.  See,  for  example  Green  and  Adkins  [10],  Wu  [21],  Libai  and  Simmonds 
[13].  Approximate  strain  energy  functions  for  many  biological  membranes  have  been 
obtained,  although  these  apply  under  restricted  conditions  only  [22,23]. 

"Rigid-inextensional"  behavior  where  all  the  E  p  are  set  to  zero  (except  for 


wrinkling)  is  also  a  constitutive  relation. 

While  the  assumption  of  dependence 

membranes,  this  is  not  necessarily  true  in  "shell  membranes",  where,  in  the  most  general 


While  the  assumption  of  dependence  on  the  E^p  only  appears  to  be  valid  for  ideal 
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case,  dependence  on  the  K^p  might  be  expected  too.  However,  to  conform  with  common 
usage,  the  possible  dependence  on  the  Kap  will  be  ignored  in  the  sequel. 

(b.4)  Boundary  Conditions 

These  come  out  of  the  virtual  work  identity  for  the  deformed  membrane 


jT*-$Vds  +  JJp8Vda  =  JjTa-SV,ada 
T  a  a 

where  T*  is  the  applied  force  per  unit  length  and  8V  is  the  virtual  displacement. 
Integration  by  parts  yields  the  work  form  of  the  equilibrium  equations  and  also  the 

boundary  conditions  (T*-Tava)-6V=0. 

The  natural  conditions  in  component  form  are  T*  =  T*X  =  Na^v  T*  =  T*-v  =Na^v  vn 

r  s  a  p  v  a  p 

and  T*-N=|T*jcos(T*,N)=0.  The  first  two  conditions  equate  the  applied  shearing  and 
normal  forces  m  the  tangent  plane  to  the  corresponding  stress  resultants.  The  third  is 

a  condition  on  the  field  variable  N  itself  (since  the  direction  of  T*  is  known).  It  is 
an  equilibrium  condition,  but  it  sets  up  a  kinematic  requirement  on  the  direction.  At  a 

free  edge,  T*=0  and  only  two  conditions  can  be  applied.  This  agrees  with  the  nature  of 
the  membrane  equations,  as  will  be  seen  later.  The  vector  form  of  the  boundary 
conditions  can  also  be  related  to  undeformed  directions,  leading  to  natural  conditions 
in  the  displacement  formulation  [18]: 

he +  ■  T*  S 


-  <PpNa(iva  =  T*  N 


Again,  if  T*=0,  then,  from  the  first  equation  Na^va=0  and  the  second  equation  is 
identically  satisfied,  so  that  no  condition  on  <Pp  can  be  set 


The  kinematic  conditions  emerge  from  8V=Q.  In  the  displacement  formulation  they 
require  the  specification  of  the  displacements  v  ,w  from  the  undeformed  configuration. 

Taken  with  respect  to  the  current  state,  the  conditions  8va=0,  8w=0  may  be  used  in 
problems  with  evolving  kinematic  conditions.  More  commonly,  they  are  translated  into 
conditions  on  the  intrinsic  quantities  on  the  boundary:  the  extensional  strain  e  ,  the 


normal  curvature  K 


and  the  geodesic  curvature  K  .  For  example,  if  the  boundary  does  not 

& 


deform,  then  eg=0,  K 


K=K 
n  n 


In  many  engineering  applications,  the  curvatures  and 

strain  are  the  obvious  data. 

A  more  general  approach  relates  the  natural  and  kinematic  conditions  to  the  behavior 
of  a  surrounding  "boundary  beam",  where  the  ordinary  differential  equations  of  the  beam 
become  the  boundary  conditions  of  the  membrane  [24],  It  is  also  possible  to  utilize  var¬ 
iational  formulations  for  a  direct  derivation  of  the  boundary  conditions  in  intrinsic 
form.  As  a  word  of  caution,  it  must  be  emphasized  that  the  boundary  conditions  strictly 
apply  to  ideal  membranes  only.  In  shell-membranes  the  bending  component  of  the  boundary 
conditions  is  always  present  to  some  degree.  Also,  compressive  boundary  loads  may  cause 
wrinkling  in  ideal  membranes. 


(c)  Angular  Discontinuities 

Conditions  of  kinematic  and  static  continuities  exist  across  internal  lines  in  the 
membrane:  The  deformed  position  and  stress  resultant  vectors  must  be  continuous,  and, 

unless  the  normal  force  across  the  line  vanishes ,  the  unit  normals  N  must  coincide.  The 
latter  can  also  be  obtained  from  the  elementary  observation  that  membrane  equilibrium 
cannot  be  locally  maintained  in  the  presence  of  an  angular  discontinuity  with  a  non  zero 
tensile  force  (see  Fig.  1).  This  leads  to  two  conclusions:  (a)  preexisting  lines  of 


A.  Not  in  equilibrum 


B.  Shell  Membrane 


C.  Ideal  Membrane 


Figure  1.  Angular  discontinuity  with  tensile  force. 


angular  discontinuity  are  smoothed  out  by  a  nonzero  tensile  force,  (b)  If  the  force 
vanishes,  then  a  line  of  angular  discontinuity  is  permitted  (even  if  it  did  not  pre¬ 
exist).  As  before,  the  above  is  strictly  valid  for  ideal  membranes  under  tension  only. 
In  shell  membranes  the  membrane  state  breaks  down  near  the  Sine  and  bending  effects  in¬ 
termix.  Hence,  the  immediate  region  near  the  fold  should  be  taken  out  of  the  analysis. 
Yet,  the  procedure  of  disregarding  very  local  bending  zones  is  common  in  the  analysis  of 
sufficiently  thin  highly  loaded  shell  membranes.  They  can  be  thus  analyzed  as  ideal 
membranes. 


4.  THE  NATURE  OF  THE  MEMBRANE  PROBLEM 
(a)  Behavior  of  the  Equations 

The  membrane  field  equations  form  a  sixth  order  nonlinear  partial  differentia] 
system.  Compared  with  linear  theory,  the  nonlinear  terms  raise  the  order  from  four  to 
six.  They  occur  in  the  highest  derivatives  and  involve  the  current  state  of  stress  of 
the  membrane.  Consequently,  the  membrane  may  behave  in  radically  different  ways, 
depending  on  the  nature  of  the  state  of  stress  and  deformation  in  the  neighborhood  of 
different  solutions.  A  helpful  and  convenient  method  for  the  study  of  the  system  in  a 
specific  region  of  its  variable-space  is  to  expand  it  around  a  solution  point 
("reference  state")  in  the  region,  and  consider  the  perturbation  series  from  this  point. 
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Properties  near  this  state  can  r>e  gleaned  from  the  linear  terms  in  the  expansion  in  the 
sense  of  "small  deformations  superposed  on  large"  which  has  many  applications  in 
Nonlinear  Mechanics  [9,25].  In  membranes  it  has  been  used  to  investigate  stability  and 
vibrations  [18,26-29]  but  has  additional  practical  applications  in  cases  where  a 
solution  is  sought  which  is  "not  far"  in  some  sense  from  a  known  state  [18,24,14,30]. 

Ut  (°j  den°.e  *e  reference  slate.  Let:  e^E^-E^*.  k^B^-B^. 
naP=ivjaP-N“P^  Ap  ,  Ap  be  the  increments  from  the  reference  state.  Restrictions  are 
imposed  that  the  eap  and  Yp^  be  sufficiently  small  such  that  linearization  can  be  per¬ 
formed  in  these  variables,  and  that  no  wrinkling  takes  place.  Substitution  into  the 
field  equations  yields  the  following  set  of  incremental  equations: 


^la  +  N?!)4  +  V3  =  0 


"“M?  +  N?o)k«|3  +  "“V  *  AP  =  0 

£pY<kap|Y-Bip^Y»  =  ° 

*&*>■ ♦**!«*  ♦$  k<x|v  v;  - 0 


noP  =  caPrS  U  1«»evS 

*S  l  EaB'EySJ  * 


r  L  ^ 

where  Kq  is  the  Gaussian  curvature  of  the  reference  state  (o),  and  all  tensorial  opera¬ 
tions  are  performed  in  this  state.  For  quadratic  strain  energy  densities,  the  is 

the  linear  elastic  tensor.  Since  the  kftp  are  not  assumed  to  be  small,  nonlinear  (under¬ 
lined)  terms  are  retained  (in  fact,  were  the  (o)  system  chosen  as  the  undeformed  state, 
then  a  small-strain-finite-rotation  membrane  theory  would  have  resulted!).  In  the  above, 

the  cofactor  tensor  is  defined  by  f^se^e^f  g  and  the  incremental  Christoffel  tensor 

is  given  by  its  linearized  form: 

a  X  ,  X  X 

YpY  "ea  P  ep  a  eap 

For  a  complete  linearization,  the  underlined  terms  should  be  dropped.  Henceforth,  Apa=0 
is  assumed  for  simplicity.  To  facilitate  further  study,  a  D.M.V.-type  approximation  is 
made  by  replacing  tangential  equilibrium  and  compatibility  with  a  stress  function  F  and 
curvature  function  G: 


n«P  _  p  |«P  +  R 


k«P  =  Gl 


ap  +  Rk 


where  the  Rj  are  lower  order  remainder  terms.  This  approximation  has  a  wide  range  of 

applicability  in  shell  theory.  For  the  present  study  it  suffices  to  note  that  it  is  also 
useful  for  a  qualitative  evaluation  since  it  preserves  the  highest  derivatives  of  the 
system  (however,  more  exact  substitutions  are  available,  as  will  be  seen  later). 
Furthermore,  the  incremental  constitutive  relations  are  assumed  to  be  isotropic  with 
effective  moduli  Et  and  v  (this  is  exact  for  quadratic  strain  energies).  Substitution 


and  rearrangement  yield: 

L^(F)  +  Ln(Ci)  +  Ap  =  (normal  equilibrium) 

■jlj  V4(F)  +  L^(G)  =  Rg  (compatibility) 

Here,  are  remainder  terms  involving  lower  order  derivatives  and  the  differential 
operators  L^,  and  V4  are 

AaB 

L^(0  =  B(Q)f  ap  =  "curvature  operator" 

Ln(f)  =  N^j^f  ap  =  "stress  operator" 

V4(0  =  f|a^ap  =  "biharmonic  operator" 

Further  elimination  yields  the  nonlinear  membrane  equation: 

m  s/m  -  '  4<vp>  =  rf- 

For  the  corresponding  displacement  formulation,  see  Budiansky  [18]. 

Discussion: 

(1)  The  system  is  always  twice  elliptic,  but  the  third  operator  depends  on  the 
reference  state  of  stress.  This  influences  the  boundary  conditions  which  can  be 
assigned.  At  a  free  edge,  for  example,  the  stress  operator  is  parabolic  and  only  two 
conditions  can  be  assigned  (see  previous  discussion). 

(2)  The  operator  ^  Ln  is  of  the  order  of  the  strains  Eq  of  the  reference  state.  If 
it  is  small  compared  with  rL^  then  a  boundary-layer  type  solution  is  to  be  expected  of 
size  iV*T\  This  breaks  down  if  eo=0(l)  or  if  L^(F)=0  (in  the  latter  case  are  included 
problems  of  linear  membrane  theory  where  the  boundary  conditions  are  inappropriate  for 
H  [2]). 

(3)  The  number  and  arrangement  of  boundary  conditions  does  not  depend  on  L^,  in 

contrast  to  linear  theory,  where  it  controls  the  behavior.  However,  for  small  strains 
and  appropriate  boundary  conditions,  L,  is  dominant  away  from  the  edges. 

0  aB 

(4)  If  an  unloaded  state  is  chosen  as  reference,  then  N.^O  so  that  L  =0.  The 

( o)  n 

equations  uncouple  and  reduce  to  the  fourth  order  linear  membrane  system  which  can 
accommodate  only  2  boundary  conditions  and  depends  heavily  on  L^.  Singular  perturbation 

techniques  may  be  needed  to  start  a  nonlinear  problem  from  this  state. 

Choice  of  reference  state: 

(1)  A  convenient  reference  state  is  one  which  has  a  simple  geometry  and  can  be  easily 
analyzed.  It  need  not  even  have  the  same  boundary  data  as  either  the  undeformed  or  final 
states.  For  example,  a  pressurized  circular  membrane  is  a  good  choice  for  problems 
involving  oval  or  incomplete  cylinders. 
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(2)  In  cases  with  sufficiently  rigid  boundaries  and  small  strains,  the  of  the 

linear  membrane  solution  plus  the  of  the  undeformed  state  can  be  used  for 

approximate  analysis.  In  shell-membranes  this  has  the  advantage  of  easy  merging  with 
bending  effects  where  appropriate. 

(3)  Obviously,  the  chosen  state  should  be  sufficiently  "close"  to  the  final  solution 
in  order  to  ensure  meaningful  results  (easier  said  then  done). 

(b)  Dependence  on  the  initial  shape 

A  study  of  the  intrinsic  form  of  the  equations  shows  that  the  only  initial  data  which 
is  formally  needed  is  the  metric  Aap  (which  appears  in  the  constitutive  relations)  and 

ot  oc 

the  definition  of  the  boundary  T:u  =u  (S).  All  other  properties  of  R,  including  the 
undeformed  curvatures  Bap  do  not  appear  in  the  equations.  The  boundary  data  and  loading 

are  usually  related  to  the  final  form  (e.g.  pressure)  or  to  extrinsic  directions  (e.g. 
axial  loads  in  a  shell  of  revolution),  but  rarely  to  the  initial  shape.  It  follows  that 
all  membranes  having  the  same  Aap  and  T  should  have  the  same  final  state.  This  includes 

the  class  of  surfaces  which  are  isometric  to  the  initial  surface,  including  those  with 
any  number  of  angular  discontinuities  (folds).  For  all  practical  purposes,  the  membrane 
can  be,  initially,  even  "packed  in  a  box". 

The  displacement  formulation  requires  an  initial  surface  to  measure  the  displacements 
from.  There  is  some  arbitrariness  in  the  choice  since  any  surface  from  the  isometric  set 
can  be  chosen.  If  the  initial  metric  plus  kinematic  boundary  data  ensure  the  existence 
of  a  unique  Bap  in  the  class  of  smooth  functions,  then  the  resulting  surface  is  commonly 

chosen  for  R.  It  should  be,  however,  noted  that  this  is  an  artifice,  since  the  boundary 
data  is  provided  for  the  final  state  only,  and  a  deformation  of  the  boundary  between  the 
initial  and  final  states  is  possible.  To  cite  an  example:  the  shapes  of  the  ends  of  a 
closed  pressurized  cylindrical  membrane  are  assigned  to  be  plane  ellipses.  Yet,  the 
initial  shapes  can  be  circular,  with  specified  edge  displacements  as  kinematic  boundary 
conditions.  This  choice  of  initial  shape  might  be  easier  to  solve  than  the  choice  of  an 
initial  elliptical  shape. 

A  shell-membrane  does  have  a  unique  undeformed  body  to  start  from.  The  need  for  it 
arises  whenever  an  interaction  with  bending  theory  is  required.  However,  it  is  not  a 
part  of  the  formal  membrane  data,  unless  it  is  chosen  as  the  initial  state. 


(c)  Wrinkling  and  Tension  Fields 

Very  thin  shells  and  other  shells  having  low  bending  rigidities  can  support  little 

compressive  stress  before  they  buckle.  If  such  shells  are  pulled  in  one  direction  and 

compressed  in  the  other  (such  as  when  subjected  to  shear),  a  large  number  of 

high-aspect-ratio  buckles  forms  with  creases  oriented  in  the  tension  direction.  As  the 
bending  rigidity  goes  down,  so  do  the  buckling  stress  and  the  distance  between  crease 
lines.  In  ideal  membranes,  the  instability  stress  approaches  zero  and  the  number  of 
buckles  becomes  very  large,  thus  forming  a  typical  "tension  field",  which  is  not  an 
uncommon  sight  in  very  thin,  flexible  membranes.  The  assumed  inability  to  carry 

compressive  stresses  is  common  to  all  ideal  membranes.  It  has  been  initially  used  by 
Wagner  [31]  to  examine  the  postbuckling  behavior  of  thin  plates  in  shear,  and  was  shown 
by  Steigmann  [32]  to  be  exact.  Several  investigators  undertook  the  study  of  tension 
fields  in  plane  membranes  [33-38].  Wrinkled  biological  membranes  •  were  studied  by 
Danielson  et  al.  [39].  Curved  wrinkled  membranes,  which  are  of  more  interest  in  this 
review,  were  studied  by  Kondo  [40,41],  Wu  [42,43],  Zak  [44]  and  others.  A  more  extensive 
study  of  the  foundations  of  plane  and  curved  tension  field  theory  was  initiated  by 
Piplan  [4bj,  Steigmrmn  [46]  and  in  collaboration  [47,48].  A  numerical  approach  was 


developed  by  Roddeman  et  al.  [49].  The  instability  due  to  wrinkling  of  bent  pressurized 
cylindrical  membranes  was  investigated  by  Comer  and  Levy  [50].  Koga  [ 27 ]  and  Lukasiewicz 
et  al.  [51].  The  ponding  instability  of  partly  wrinkled  spherical  membranes  was  analyzed 
by  Glockner  and  his  associates  [52-56],  Croll  [57]  has  studied  the  formation  of  wrinkles 
in  transversely  loaded  plates  and  Libai  analyzed  boundary  layers  adjacent  to  wrinkles 
[58].  The  list  is  not  exhaustive  and  will,  in  all  probability  expand  in  the  near  future 
as  partial  wrinkle  fields  will  be  routinely  incorporated  into  nonlinear  membrane 
analysis.  Also,  more  effort  should  be  put  into  "incomplete  tension  fields''  which 
describe  the  transition  from  shell  to  wrinkled  membrane,  and  into  the  wrinkling  of 
biological  membranes. 

The  underlying  idea  of  tension  field  theory  is  to  replace  'he  complex  wrinkled 
surface  (with  its  random  form  of  detailed  wrinkles)  with  a  smoothed  out  "pseudo  surface" 
[43]  or  "wrinkle  surface”  which  could  be  imagined  as  a  continuous  distribution  of 
wrinkles  or  fibers,  and  constitutes  a  regular  surface  in  space.  This  is  equivalent  to 
the  concept  of  a  generalized  curve  or  surface  in  the  calculus  of  variations  [59],  The 
pseudo-surface  should  satisfy  the  equilibrium  equations,  with  one  of  its  principal 
stress  resultants  being  positive  and  the  other  set  to  zero.  For  convenience,  a 
non-material  orthogonal  coordinate  system  which  coincides  with  the  principal  stress 

directions  is  employed.  Nontensorial  notation  is  used,  where  Nj  is  the  nonvanishing 
principal  resultant,  are  unit  vectors  in  the  principal  directions  and  A^=v  X  Then: 
T1  =  N,e,.  'f-O,  e|  |=  - -t- 2e2  +  !- A|N.  P=P(«)e0  <  pN. 

A2  7I 

llie  vector  and  component  equations  of  equilibrium  become,  respectively: 

(NjAjCjJ.j  +  A,A2p  =  0 

^2^1>’l  ^iA-jP(j)  —  0 

-  AUN,  +  A,A2p(2)  =  0 

—  Nj  +  p  =  0 
F1 

where  both  Nj  and  the  geometry  (A|,A2,r|)  are  unknowns.  The  nature  of  the  loading 
determines  the  properties  of  the  stress  trajectories: 

(1)  If  P(2)=0’  then  e  ^  |  is  parallel  to  N  and  the  trajectories  are  geodesies. 

(2)  If  p=0,  then  -~0  and  the  lines  are  asymptotic  (with  K<0). 

?1 

(3)  If  both  are  zero,  then  ej  j=0.  The  lines  are  straight,  and  the  surface  is  ruled. 

(4)  If  p^y=0,  then  the  force-per-elemenl  (NjA2)  is  constant.  This  is  analogous  to 
flow  lines  in  fluid  flow. 

The  geometric  variables  (A  j  ,A2>r  j  ,r-,,r  p)  are  connected  through  the  continuity 
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equations  of  Codazzi  and  Gauss. 

To  tie  the  solution  to  the  rest  of  the  membrane  and  to  the  initial  metric,  consti¬ 
tutive  relations  are  needed.  Only  isotropic  membranes  will  be  considered  at  this  stage 
(non-isotropic  membranes  were  studied  by  Mansfield  [60]  and  Roddeman  [49]).  Here,  the 
principal  directions  of  stress  and  strain  coincide.  The  condition  N'o=0  can  be  used  to 

eliminate  the  true  principal  strain  lE^  from  the  constitutive  relations,  which  leaves 


Nj=f(Ejj).  The  difference  between  the  apparent  strain  £2^  of  the  pseudo-surface  and  the 


true  strain  is  the  "pseudo  strain" 


22‘ 


It  was  introduced  by  Wu  [43]  and  is  used 


as  an  extra  variable  in  lieu  of  the  extra  equation  ^=0  which  was  imposed  on  the  field 


equations.  The  constitutive  relation  does  not  add  an  additional  variable,  but  should  be 
used  to  relate  the  tension  field  to  the  material  system.  If  there  are  no  additional 
symmetries,  the  problem  may  be  complex  [46]. 

Other  methods  for  constitutive  modification  where  employed  by  Reissner  [33]  who  used 
a  special  orthotropic  material  law  and  Stein  et  al.  [34]  who  introduced  a  variable 
Poisson  ratio.  More  recently,  Pipkin  [45]  introduced  a  "relaxed"  energy  density  to 
account  automatically  for  the  modified  constitutive  relations. 

The  boundary  conditions  with  the  unwrinkled  portions  of  the  membrane  are  obtained 
from  continuity  requirements: 

(1)  The  resultant  Nj^O  is  continuous  across  the  boundary. 


(2)  N2=0  as  the  boundary  is  approached  from  the  unwrinkled  side. 


(3)  If  Nj^O,  then  angular  continuity  is  preserved. 

(4)  Positional  continuity  is  maintained,  thus  implying  E^^O- 

Examples:  See  section  5  for  the  axisymmetric  case.  See  section  7  for  the  edge  zone 


near  a  wrinkle  field.  Further  investigations  and  details  may  be  found  in  the  literature 
cited  above. 


5.  AXIMEMBRANES 

These  are  membranes  whose  undeformed  shapes  are  surfaces  of  revolution  and  are  axi- 
symmetrically  deformed.  Because  of  their  relative  "simplicity"  they  have  been  studied 
extensively  in  literature  and  solutions  to  many  nonlinear  problems  exist.  Hence,  their 
discussion  here  will  be  limited  in  scope.  More  information  and  details  can  be  found  in 
the  literature  and  some  have  been  gathered  in  chapters  in  books.  Examples  are  Green  and 
Adkins  [10]  who  discuss  constitutive  relations  and  some  finite  strain  problems,  and 
Libai  and  Simmonds  [13,  chapter  5]  who  made  a  more  extensive  survey  with  examples. 

The  reference  surface  of  the  undeformed  membrane  is  defined  in  a  fixed  cylindrical 
coordinate  system  (r,9,z)  by  its  meridional  position  vector  R=r(s)er+z(s)ez  where  s  is 

arclength.  The  angle  a  between  R,  and  e  satisfies  r,  =cosa;  z,  =sina.  The 

s  r  s  s 

corresponding  quantities  in  the  deformed  meridian  are  r,  z,  a.  A  nontensorial  approach 
is  adopted.  Meridional  and  circumferential  stretches  are  defined  by  and  Xg=^, 

respectively.  Stress  resultants  are  Ns  and  Ng  (or  Ng  and  Ng  per  unit  undeformed  length). 
The  loading  vector  is  P=PHer+Pvez  and  may  depend  on  the  field  variables.  The  equilibrium 


equations  are 
(rNssina),s  +  rpy  =  0 


(rNscosa),s  =  Nq  -  rpH 


In  the  case  of  uniform  pressure  p  one  may  also  use  the  partly  integrated  form: 

-  [ - 2  - 

rNs$ina  =  j  p  r  -  Cj  (rNs),s  =  Ngcosa 

To  these  are  added  the  equations  of  geometry  and  constitution: 

s  =  <rV*s =  Vos“  s =  Vin“ 


Ns  =  Ne  = 

s  0 

where  <I>(A.s,Xq)  is  the  strain  energy  density.  For  boundary  conditions,  the  position  or 

stress  resultant  vectors  should  be  specified,  subject  to  overall  equilibrium.  The  system 
is  4th  order  (but  can  be  usually  reduced  to  second  order  plus  two  quadratures).  It  is  a 
two-point  b.v.p.  which  requires  appropriate  numerical  procedures  (e.g.  Runge  Kutta  plus 
shooting)  or  approximate  techniques  for  solution. 

Pressurized  wrinkled  aximembranes  with  meridional  tension  fields:  Here  Nq=0.  Further 

integration  yields  rNs=r  Ns=c2_  The  meridian  of  the  pseudo-surface  can  be  expressed 
explicitly  in  terms  of  the  Jacobi  elliptic  function  dn  with  modulus  k: 

r  =  ^  v42/  p  dnj^  / p/c2  (s  -  sQ)j  where  k2  =  2(1  +  Cj/c^'1 

To  relate  the  result  to  the  undeformed  surface,  the  equations  Nq=0  and  Nsr(s)=c2  are 
used  in  the  constitutive  relations  to  eliminate  Xq  and  express  X^  in  terms  of  s.  This 

establishes  s(s)=Asds.  If  stretching  is  neglected  (X2=l)  then  s=  s.  The  constants  c- 

are  obtained  from  the  edge  conditions  of  the  wrinkled  zone. 

Aximembranes  cannot  deform  inextensionally:  any  radial  displacement  and  any  nonuni¬ 
form  axial  displacement  produce  straining  (an  exception  is  near  the  apex  where  a  local 
semi-inextensional  deformation  is  possible).  However,  if  wrinkles  are  permitted,  then 
inextensional  deformation  become  possible  and  even  common  [13,42,52-56].  Barring 
wrinkles,  it  follows  from  small  strain  moderate  rotation  theory  that  linear  theory  is 
valid  interior  solution  for  many  aximembranes,  with  narrow  boundary  layers  of  size  iVe 
added  to  account  for  support  conditions.  Many  studies  of  this  phenomenon  exist  [61-65]. 
The  use  of  nonlinear  correction  for  internal  continuity  finds  its  use  in  toroidal 
membranes  [66,67].  In  problems  involving  large  strains,  boundary  layers  usually  do  not 
exist,  and  problems  associated  with  inflation  and  stability  are  the  more  important 
topics.  Constitutive  relations  assume  a  major  role  and  combine  with  the  evolving 
geometry  to  produce  limit  point  [10,68]  or  bifurcation  type  [69-72]  instabilities. 

Aximembranes  can  be  classified  by  their  undeformed  shapes.  Within  each  class  the 
small-strain  problem  is  studied  separately.  Some  cases  will  be  briefly  reviewed: 

Plane  Circular  Membranes:  The  small  strain  moderate  relation  theory  is  due  to  Foppl 
[73].  A  less  restrictive  theory  is  due  to  Reissner  [74],  Many  solutions  for  pressure 
and/or  end  loaded  circular  and  annular  membranes  are  found  in  the  literature.  Some  were 
summarized  by  Libai  and  Simmonds  [13].  More  have  recently  appeared  [75-79].  In  the  case 
of  large  strains,  inflation  [10,80-83]  and  deformation  under  concentrated  loads  [84-86] 
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are  some  of  the  studies  of  interest.  The  inverse  problem  of  deformation  into  a 
predetermined  inflated  shape  has  also  been  considered  [81,82,87,88], 

Spherical  Membranes :  Small  strain  problems  lead  to  the  study  of  edge  effects.  Shallow 
spherical  membranes  were  treated  by  Dickey  [76].  The  common  large  strain  problem  is  that 
of  the  inflation  of  spherical  balloons.  Of  special  interest  is  the  possible  existence  of 
a  maximum  pressure.  In  volume-controlled  problems,  a  further  increase ;  in  volume  would 
lead  to  a  decrease  in  the  pressure,  this  trend  may  reverse  itself  at  higher  volumes,  and 
is  strongly  material  dependent  [10,69].  The  bifurcation  of  a  spherical  balloon  into  a 
nonspherical  shape  has  also  been  studied  [69-72]. 

Circular  Cylindrical  Membranes :  In  the  small  strain  range  the  main  issue  is  the  edge 
effect  due  to  kinematic  constraints  or  nonaxial  end  loads.  Also  if  the  end  load  is 
radial,  a  small  part  of  the  cylinder  deforms  into  an  annulus,  while  the  rest  remains 
cylindrical.  At  the  line  of  angular  discontinuity,  Ns=0  [13,62].  The  large  strain 

problem  has  the  first  integral  [89]  <b-Xs<t>,^  =C,  where  C  is  a  constant  of  integration. 

s 

This  has  been  used  to  reduce  the  large  strain  problem  of  pressurized  cylindrical 
membranes  to  quadratures  [13,90,91].  The  stretching  of  a  cylindrical  membrane  attached 
to  rigid  end  rings  has  also  been  studied  [82,90].  Finally,  as  in  spherical  membranes, 
the  possibility  of  a  maximum  pressure  exists  here  too  [92,68]. 

Toroidal  Membranes:  The  important  issue  in  the  small  strain  theory  for  circular 
toroidal  membranes  is  adjustment  of  the  apparent  discontinuity  at  the  apex.  This  was 
accomplished  by  Jordan  [66]  and  by  Sanders  and  Liepin  [67].  Problems  involving  large 
strains  were  treated  by  Kydoniefs  and  Spencer  [93].  The  major  difficulty  in  nonsymmetric 
pressurized  toroidal  ideal  membranes  (where  r(a=0)*r(a=7t)>  lies  in  the  fact  that  a 

necessary  condition  for  equilibrium  is  r(a=0)=r(a=Jl).  This  must  hold  regardless  of  the 
magnitude  of  the  pressure.  Slight  asymmetries  can  be  taken  care  of  by  local  nonlinear 
rotations  which,  as  pointed  out  by  Kuznetsov  [94],  are  nearly  inextensional  at  a=0,7t. 
However,  if  the  asymmetry  is  substantial,  then  large  strains  are  required  to  take  place 
even  if  the  pressure  is  very  small.  This  "toroidal  paradox"  was  used  by  Goldenveizer  [2] 
in  discussion  of  the  existence  of  membrane  solutions,  however,  if  wrinkling  is  accepted 
?«  a  part  of  the  deformation  process,  then  a  solution  might  be  possible,  and  should  be 
tried.  To  account  for  axial  equilibrium,  the  tension  field  must  be  meridional  (Nq=0) 

wherever  it  occurs.  In  addition,  wherever  Ns=0,  a  circumferential  line  of  angular 

discontinuity  is  permitted.  The  membrane  is  unstable  at  its  undeformed  form,  so  that 
large  scale  wrinkling  takes  place  even  at  low  pressures. 

For  the  shape  of  the  membrane  in  unwrinkled  regions,  the  general  equations  should  be 
used,  but  if  the  pressure  is  low,  the  extensional  strains  can  be  neglected  and  the 
forces  calculated  by  linear  membrane  theory.  In  the  wrinkled  regions,  the  theory  for 
wrinkled  axisymmetric  membranes  [43,47]  applies,  and  the  shape  can  be  calculated  without 
difficulty,  as  was  seen  earlier  in  this  section. 

Large  Strain  Asymptotic  Behavior :  In  the  case  of  very  large  strains  a  simplified 
asymptotic  analysis  can  be  made.  Of  some  interest  is  the  possible  tendency  of  closed 
inflated  aximembranes  to  assume  a  spherical  shape  as  the  pressure  and  stretches  become 
very  large.  This  was  shown  for  a  class  of  materials  of  the  Mooney  type  by  Isaacson  [95], 
Wu  [21]  and  Sagiv  [96].  Antman  and  Calderer  [97]  investigated  the  restrictions  that  must 
be  put  on  the  energy  density  of  a  spheroidal  shell  is  to  approach  a  spherical  shape.  The 
behavior  of  Neo-Hookean  membranes  with  very  large  meridional  strains  was  studied  by 
Foster  [98]. 


6.  CYLINDRICAL  MEMBRANES 


In  this  section,  some  problems  related  to  pressurized  and  stretched  noncircular 
cylindrical  membranes  are  discussed.  Included  are  cases  where  the  boundary  has  a  non¬ 
circular  (oval)  form,  or  where  processes  cause  an  initially  circular  membrane  to  become 
non-circular  in  its  deformed  state.  The  general  procedure  is  that  of  an  incremental 
analysis,  with  the  pressurized  and  stretched  unbounded  circular  cylindrical  membrane 
taken  as  the  "reference  state"  (section  4a).  This  is  a  point  in  the  solution  domain  for 
all  cylindrical  membranes.  It  has  a  simple  unvarying  geometry  to  which  all  homogeneous 
pressurized  oval  membranes  tend,  far  enough  from  their  ends.  The  main  restriction  is 
that  the  final  deformed  state  should  be  sufficiently  "close"  to  the  reference  state  so 
as  to  make  an  incremental  analysis  (including  linearization  in  the  metric  and  its  first 
order  derivatives)  valid  in  some  approximate  sense. 

Let  x  and  s  denote  longitudinal  (generator)  and  circumferential  (hoop)  arclength 
coordinates  in  the  reference  state.  The  membrane  material  is  assumed  to  be  non  linearly 
homogeneous  and  orthotropic,  such  that  a  circular  cylindrical  membrane  of  length  L  and 

radius  r  is  transformed  by  pressure  p  and  total  tension  T  into  a  circular  cylinder  of 

“xx  T  “ss  — 

length  Lq  and  radius  rQ.  Let  N^oy=Px=27tr"  and  ^(0)=^s=Pr0  he  l^e  physical  Cauchy  stress 
resultants  in  this  reference  state.  Let  n  ,  n  ,  n  ,  e  ,  e  ,  e  ,  K  ,  tc  ,  K  be  the  incre- 

X  $  l  X  S  I  X  S  l 

mental  stress  resultants,  strains  and  curvatures,  all  measured  with  respect  to  the 
Cartesian  metric  (x,s).  Then,  using  (4a),  the  following  field  equations  result: 

<"x  -  Pses  +  Pxex>'x  +  +  2Psc.>'s  -  0 

<"s  "  pxex  +  pses'-s  +  <nt  +  2PxV-x  =  0 

px\  +  psKs +  r  ",  -  <\\  +  Vs  +  2n.Kt)  =  0 
o  - 

(K  “  f-  ex),  -  (K  -  ~  et),x  =  0 
o  o 

(k  -  —  e  ),  -  k.  =0 
s  rQ  s7x  t,s 

1  2 
—  K+e  +e  -2e,  +kk-k,  0 
rQ  x  x,ss  s,xx  t,xs  x  s  t 

e  =  f  n  -  f  n 
x  xx  xs  s 

Es  _  ^sns  ^xsnx 
2et  =  ftnt 


where  the  T  are  constitutive  coefficients  obtained  by  inverting  the  incremental  consti¬ 
tutive  relations  of  the  reference  state  in  terms  of  Cauchy  resultants.  For  quadratic 
energy  densities,  these  are  the  elastic  constants,  in  which  case  the  equations  should  be 
similar  to  those  of  Libai  [30]  and  Danielson  and  Simmonds  [99]). 


Following  section  (4a),  but  without  making  the  approximations  therein,  stress  and 

curvature  functions  are  defined  by  putting  n  =~F,  and  K  =G,  .  Substitution  yields: 

1  XS  l  xs 


n  =  (1  +  a  )F,  +  a  F, 

x  v  x7  ss  xs  xx 


n  =  (1  +  a  )F,  +  a  F, 

s  '  s'  XX  sx  ss 


e  =  f  (1  +  6  )F, 

X  xv  Kx7  ss 


f  (1  +  6  )F, 
xsv  “xs7  XX 


e  =  f  (1  +  6  )F,  -  f  (1  +  8  )F, 

s  sv  rs7  xx  xsv  *sx  ss 


~f.F, 
t  xs 


k  =  G,  +  —  e  ;  ic  =  G,  +  i-(e  -  2f.F,  ) 
s  ss  r  $  x  xx  r^  x  t  xx7 

o  o 

where  the  a-  and  are  constants  having  the  order  of  magnitude  of  the  strains  at  the 

reference  state,  and  are  easily  calculated  [100].  Introduction  into  the  remaining  two 
equations  leads  to  one  4th  order  and  one  2nd  order  equation  in  F  and  G.  These  are  non¬ 
linear  due  to  the  (FG)  and  (GG)  terms  -  as  is  common  in  similar  formulations. 

In  many  practical  applications  all  the  nonlinear  terms  in  the  increments  may  be 
suppressed  (at  least  as  a  first  approximation).  If,  in  addition,  the  strains  at  the 
reference  state  are  small,  the  equations  simplify  to 

—  G,  +  V?F  +  -L[f  F,  +  (2f,  -  f  )F,  ]  =  0 

rQ  ’xx  1  r2l  x  ss  v  t  xs7  xxJ 

o 

PG  +PG  +  —  F  -  —  P  f  F  =0 

rx°’xx  +  su,ss  +  r  r,xx  r  s  xs^’ss  u 
o  o 

where  the  modified  biharmonic  operator  v|  is  defined  by 

Vf  F  =  f  F,  +  f  F,  +  2(f  -  f  )F, 

1  s  xxxx  x  ssss  v  t  xs7  xxss 

Further  elimination  yields  the  6th  order  equation 


r°VI<PxF-xx  +  PSF'SS>  -  Fxxxx  *  <Pxrx  +  Psfxs  -  2P/.>F-xxss  -  PSfXF'SSSS 

In  the  DMV  approximation,  the  right  hand  side  of  the  equation  is  set  to  zero.  For 

quadratic  energy  densities  of  isotropic  membranes,  set:  f  =f.=-rU,  f  =Xr,  f =r:tV. 
4  j  4  x  s  Ci\  xs  c-i  i  c  i 

VlF=EtV  F‘  As  a  practical  application,  consider  the  problem  of  an  oval  cylindrical 

membrane  subjected  to  uniform  pressure  p  and  an  equally  distributed  tensile  force  Px=T/S 

on  its  boundaries  x=0,L.  Diaphragm-type  supports  maintain  the  oval  shape  r.  Small  strain 
theory  is  assumed.  The  reference  surface  is  chosen  to  be  a  circular  cylinder  with  radius 
rQ  and  length  Lq  which  are  obtained  by  applying  the  loads  to  the  isoperimetric  cylinder 

with  radius  ra=S/2rc  and  length  L.  The  approach  consists  of  applying  to  the  incremental 

r  -r  ,  , 

problem  the  boundary  conditions  e  — 5=const.  ;  k  (s -  —  ;  n  =0. 

s  r  s  x  x 


problem  the  boundary  conditions  e  = - =const  ;  K  (s)=-7-T  -  —  ;  n  =  0. 


Of  the  three  conditions,  that  of  e  leads  to  an  edge-effect  correction,  which  is 

s 

strongly  affected  by  P  ,  while  that  of  K  affects  large  parts  of  the  membrane  ("slow 

x  s 


decay"),  which  is  of  more  interest.  A  similar  situation  exists  in  the  semi-membrane 
analysis  of  tubes  [101]  where  the  effects  of  £s  tend  to  be  suppressed. 

Solutions  to  the  case  of  lateral  pressure  (P  =0)  were  studied  by  l.ibai  [24]  who  also 

suppressed  the  boundary  conditions  on  Es,  and  used  a  Fourier  expansion  in  the  s 

direction.  The  solutions  possess  important  features  which  compare  favorably  with  those 
of  the  more  common  linear  membrane  analysis  of  an  oval.  The  latter  has  some  serious 
deficiencies  which  render  it  unacceptable  in  many  applications:  (a)  It  requires  the 
continuity  of  r(s)  and  its  derivatives  (up  to  the  4th  !).  (b)  It  does  not  tend  to  the 
circular  form  far  from  the  edges.  In  fact,  it  diverges  as  L- (c)  It  has  no  decay 
mechanism  for  boundary  disturbances,  (d)  It  cannot  accommodate  boundary  conditions  along 
its  generators  (important  for  cylindrical  panels). 

“l 4  fL  ]4 

The  behavior  of  the  incremental  solution  relates  to  the  parameter  y  =  P  -  ^  -  £s  — 

E  l  r'  '  r  a-1 
a 

(a)  It  yields  smooth  solutions  even  if  the  slope  or  curvature  cf  the  initial  form  have 
discontinuities,  (b)  It  approaches  the  circular  shape  as  y  increases,  (c)  The  typical 
decay  length  0  for  end  disturbances  with  harmonic  index  n  is 
-3/2  -1/4  *3/2  -1/4 

0  =r  n  c,  =Ln  y  .  (d)  It  can  accept  "flexible”  boundary  conditions  along  the 
generators  [30].  (e)  It  approaches  the  linear  membrane  solution  for  y«l,  and  the  ori¬ 
ginal  shape  as  p— >0.  (0  For  large  n,  it  approaches  th'  solutions  of  the  biharmonic 
equation  of  plane  elasticity.  The  method  is  being  applied  to  other  problems  [100] 
including  those  of  cracks  in  cylindrical  membranes. 

An  akin  problem  is  that  of  nonlinear  effects  in  beam-bending  of  tubes  (Brazier 
Problem).  Here,  circumferential  shell  bending  is  commonly  assumed  to  resist  the 
ovalization  which  accompanies  the  beam  bending  of  circular  cylindrical  tubes.  However, 
if  the  tube  is  thin  and  not  very  long,  membrane  resistance  to  ovalization  which  comes 
out  of  the  supports  provides  the  dominant  mechanism.  Pressurization  enhances  the  effect. 


7.  EXAMPLE:  EDGE  EFFECT  IN  A  PARTLY  WRINKLED  AXIMEMBRANE 

Consider  a  truncated  ideal  spherical  membrane  of  radius  R,  connected  to  rigid  rings 
of  radius  rc  along  its  boundaries  a=(Xc,  a=^~ac  (Fig.  2).  If  the  rings  are  pulled  apart 

by  forces  P,  the  spherical  membrane  collapses  inward,  and  a  meridional  tension  field  in 
the  form  of  a  pseudocylinder  is  created.  It  extends  almost  to  the  supports,  but  a  very 
narrow  wrinkled  region  forms  near  the  rings  [47].  The  equations  in  the  unwrinkled  region 

are  those  of  aximembranes  (section  5).  Axial  equilibrium  yields  2nrNrsina=P.  The  other 

equations  can  be  reduced  to  a  second  order  equation  in  y=cota  as  function  of  oc=s/R.  The 

boundary  conditions  at  a.^  is  rc=rc(°r  \q=1).  At  the  boundary  of  the  wrinkled  region 

a=ctg,  the  conditions  are  those  of  continuity:  cig=7i/2  and  Ng=0.  These  can  be  converted 
into  y^y-^O.  Note  that  otg  is  also  unknown,  so  that  three  conditions  are  needed  for  the 
differential  equation  and  otg. 

The  availability  of  two  conditions  at  cig  facilitates  the  solution  of  the  equation  as 
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Figure  2.  Partlv  wrinkled  truncated  spherical  membrane. 

(a)  Original  (b)  Wrinkled  (c)  Wrinkled  with  edge  zone 


an  initial  value  problem  starting  from  cig  by  either  a  power  series  or  numerical 
integration  [58],  The  condition  kg=l  determines  a,  and  consequently  ctg.  In  small  strain 
theory,  the  equations  are: 

(sinay,  ),  -  (csca)y  =  e  1  f - ^ - cosa] 

VfTT'  J 

(sina)y,a  =  v/l+y2  at  a  =  oc  ;  e  = 

Construction  of  a  formal  power  series  solution  yields  the  1st  order  solution: 

°B  -  “c  -  csfo r  +  °<«2)  <«c  *  *«)• 

C 

Near  a  =n/2  this  solution  breaks  down  and  an  additional  term  is  needed,  yielding 

V 

aB  ~  ac  =  v^ve  +  0(£3/2)  (ac  ~  tt/2) 

Large-strain  results  for  a  Neo-Hookean  material  were  obtained  by  numerical  integration. 
These  indicate  the  breakdown  of  the  edge  effect  for  large  strains,  as  expected.  All  the 
other  relevant  parameters  of  the  problem  were  also  calculated  [58]. 

Boundary  layers  0(e)  are  unusual  in  shell  theory.  In  small  strain  theory  they  could 
be  virtually  ignored  if  not  for  the  large  rotations  within  the  layer  which  indicate  the 
possibility  of  strong  interaction  with  bending  effects  in  thin  shell-membranes.  They  are 
not  easily  "captured"  by  pure  numerical  methods,  unless  looked  for. 
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APPENDIX:  SOME  TOPICS  IN  CYLINDRICAL  SHELL-MEMBRANES 
(a)  Nonlinear  Bending  of  Finite  Length  Membrane  Tubes 

A  circular  cylindrical  tube  subjected  to  a  beam  bending  moment  M(x)  deforms 
nonlinearly  due  to  the  "ovalization”  of  its  cross  sections,  until  it  collapses  or 
buckles  locally.  In  very  long  tubes,  ovalization  is  resisted  by  circumferential  shell 
stress-couples ,  and  is  known  as  the  Brazier  effect.  However,  in  Finite-length  lubes, 
membranes  resultants  which  stem  from  the  supports  participate  in  this  resistance  and 
even  dominate  it  for  appropriate  geometries. 

The  membrane  tube  problem  can  be  stated  in  terms  of  a  simplified  mixed  variational 
equation  5U*(F,G)=0,  where: 
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N  =  -  [(F,  +  F)q,  , 

x  L  <M  Msj’s 


N  =  f(F,  +  F)q,  1, 
xs  qq  /M’sj  > 


I  =  TCr  t  ; 


fty2ds: 


cross-sectional  moments  of  inertia. 


Solutions  can  be  obtained  by  expanding  F  and  G  as  truncated  Fourier  series  in  8, 

starting  with  m=2.  The  strongest  nonlinearity  is  due  to  T,  and  in  most  cases  q  may  be 
replaced  by  0  in  and  Nxs-  It  has  been  demonstrated  before  that  the  dominant  Fourier 

component  of  the  ovalization  is  m=2.  Putting 


F  =  r  f(x)sin(29)  ; 


G  =  -  rg(x)cos(20) 


into  the  reduced  functional  and  performing  8U*  =  0,  differential  equations  and  boundary 
conditions  are  obtained: 

(r2/Ext)f,xx  +  (rM/2ExI)2(I/T),g  =  0 


f/Et  -  r  (f/4E  t  -  g/18),  =  0 


(2g  -  9f/Exst),x5f  =  0  ;  g8f,x  =  0 

I  =  1(1  -  2g  -  8/9  g2  +  28/9  g3  +  32/75  g4 . ) 

Traction  boundary  conditions  are  the  specification  of  f  and  f,x(Nx  and  Nxg).  The 

kinematic  conditions  are  the  vanishing  of  their  coefficients:  the  geodesic  (warping)  and 
normal  curvature  changes. 

Equilibrium  paths  (up  to  collapse)  and  local  buckling  curves  for  finite  length  tubes 
subjected  to  a  constant  bending  moment  and  with  diaphragm  and  clamped  support  conditions 
were  presented  by  Libai  and  Bert  in  ICTAM  '92.  Good  correlation  with  previous  numerical 

1/2 

work  on  local  buckling  was  observed  for  membranes  with  (L/r)(t/r)  '  <1.2.  Beyond  this, 
circumferential  shell  bending  effects  became  significant. 

(b)  The  Decay  of  Edge  Disturbances  in  Loaded  Cylindrical  Shells  and  Membranes 

Q 

The  addition  of  a  bending  term  -DV  F  to  the  DMV  form  of  the  cylindrical  membrane 
equation  converts  it  to  a  DMV  cylindrical  shell  equation  which  is  suitable  for  the 
qualitative  analysis  of  edge  effects  and  their  decay  into  the  shell.  To  study  decay  from 
the  curved  edge ,  a  solution  F=Cexp(-x/d)cos(ms/r)  is  put  into  the  equation.  Here,  d  is  a 
characteristic  decay  length  and  m  is  the  harmonic  index.  This  yields  a  characteristic 
equation  in  d: 
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Here,  the  first  term  is  the  nonlinear  membrane  effect  (with  initial  strains  e  =PX/Et, 
es=P s/Et),  the  second  is  the  bending  effect,  and  the  third  is  the  linear  membrane 
effect.  Decay  lengths  are  given  below  for  some  important  special  cases: 


1.1)  Axisymmetric  linear  bending  (m=0,  £x=0): 


d/r=0[(t/r)1/2]. 
1/2, 


(2)  Axisymmetric  nonlinear  membrane  (m=0,  D=0):  d/r=0(£  ). 

A 


(3)  Slow  decay,  linear  bending  (£  =£  =0): 

(4)  Slow  decay,  nonlinear  membrane  (D=0): 

(5)  Slow  decay,  nonlinear  membrane  (D=0,  £,.=0): 

(6)  Plane  sheet  (l/r=0,  m-0): 


d/n=0[(r/t)1/2m"2]. 

d/r=0(£‘1/4m'3/2). 

d/r-0<£xl/2m'1). 

d/t=0(£'1/2). 


Cases  (l)-(2)  typify  rapidly  decaying  edge  disturbances  due,  for  example,  to  radial  and 
rotational  continuity  requirement.  Cases  (3)-(5)  represent  the  slow  decay  of 
circumferentially  varying  edge  input.  As  is  well  known,  linear  membrane  theory  cannot 
handle  the  diffusion  or  decay  of  nonuniform  input.  For  example,  nonuniform  loads  Nx 

applied  to  the  edges,  retain  the  same  nonuniformity  through  the  membrane,  even  for  very 
long  shells,  contrary  to  St.  Venant’s  principle.  Linear  bending  (case  3)  or  nonlinear 
membrane  theories  (case  4)  provide  weak  mechanisms  with  long  decay  lengths.  This 
justifies  the  approximation  F,s»F,x  made  in  these  cases.  Note  that  for  thin,  highly 

stressed  shells,  the  membrane  decay  length  may  be  considerably  shorter  than  the  linear 
bending  length  and  be  the  dominant  effect. 

Case  (6)  represents  the  decay  of  a  bending-membrane  edge  disturbance  from  a  long 

3/2 

stretched  sheet.  As  d— *0  the  ideal  membrane  case  is  approached.  This  occurs  as  (t  ",  E, 
-1/2 

P  )— *0,  and  demonstrates  the  three  main  reasons  for  asymptotic  ideal  membrane 

behavior :  thickness— >0,  rigidity— >0,  loads— >«>. 

The  characteristic  equation  is  also  useful  for  the  evaluation  of  the  relative 

2  2 

importance  of  nonlinear  membrane  effects  (e^,  e  )  vs  bending  effects  (t  /12r")  in 

shells.  In  many  practical  cases,  the  first  may  be  large  compared  with  the  second.  For 
example,  if  Ex=0.01  and  r/t=100,  then  the  nonlinear  membrane  term  would  be  a  thousand 

times  larger  than  the  bending  term.  This  simple  example  should  explain  why  nonlinear 
membrane  theory  is  important. 
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Abstract 

Wave  propagation,  wave  breaking,  and  the  concomitant  wave- induced  momentum 
transport  are  ubiquitous  processes  in  the  Earth’s  atmosphere  and  oceans,  a  classic  ex¬ 
ample  being  the  surface  gravity  waves  from  storms  in  the  Southern  Ocean  that  drive 
mean  longshore  currents  on  California  beaches.  Such  processes  are  not  only  interesting 
in  themselves,  but  are  also  fundamental  to  making  sense  of  the  various  'wave- turbulence 
jigsaw  puzzles’  with  which  the  atmosphere  and  ocean,  separately  and  in  combination, 
confront  us.  For  instance,  what  used  to  be  regarded  as  an  enigmatic  'negative  viscos¬ 
ity’  of  the  subtropical  atmosphere  is  now  straightforwardly  comprehensible  in  terms 
of  Rossby  wave  propagation  and  breaking.  Other  examples  include  understanding  (r/) 
why  the  mean  east -west  winds  in  the  equatorial  lower  stratosphere  reverse  every  14  1$ 
months,  throughout  a  belt  encircling  the  globe,  ( b )  why  the  lowest  temperatures  on 
earth  (as  low  as  110 K,  or  — 163°C)  are  found  not  in  the  winter  hemisphere  but  near 
the  summer  pole,  at  altitudes  between  80  and  90km,  (c)  why  the  o folding  atmospheric 
lifetimes  of  certain  man-made  chlorofluorocarbons  are  of  the  order  of  a  century,  and  (<7) 
why  the  greatest  concentrations  of  stratospheric  ozone  are  found  where  photochemical 
ozone  production  rates  are  least. 

This  lecture  will  discuss  some  of  the  theoretical-mechanical  concepts  relevant  to  un¬ 
derstanding  these  phenomena,  including  the  concepts  of  wave  'momentum'  and  wave 
‘breaking’.  There  emerge,  somewhat  unexpectedly,  what  might  prove  to  be  some  useful 
new  ideas  about  the  problem  of  water-wave  generation  by  wind.  The  main  point  is 
that  the  water  waves  can  be  systematically  amplified  by  certain  irreversible,  ratchet- 
like,  non-superposable  effects  that  depend  on  spatio-temporal  inhomogeneities,  such  as 
wind  gustiness  and  wave  'groupiness’.  These  include  the  effects  of  what  might  be  called 
‘Rossby  lee  waves’  in  the  airflow  downstream  of  water-wave  groups.  The  resulting  wave 
drag  can  amplify  non-breaking  water  waves  and  might,  for  instance,  help  to  explain  the 
growth  of  the  ‘energy  front’  reported  by  0.  M.  Phillips  in  this  Proceedings. 

1.  INTRODUCTION 

I  want  to  widen  the  context  of  this  Minisymposium  and  talk  about  some  phenomena 
and  concepts  that  appear  fundamental  to  a  whole  range  of  problems  in  atmosphere 
ocean  dynamics.  I  also  want  to  say  something  about  that  old  but  still  problematic 
topic,  the  generation  of  water  waves  by  wind.  I  cannot  claim  to  be  an  expert  on  that 
problem,  let  alone  on  air-sea  interaction  in  general,  but  it  is  possible  that  a  fresh  look 
from  another  perspective  might  help  to  advance  our  understanding. 
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The  atmosphere  and  oceans  used  to  he  thought  of  simply  as  'turbulent’  fluids  on  a 
vast  scale,  and  early  attempts  to  understand  them  often  involved  Reynolds  averaging 
together  with  the  hope  that  the  resulting  eddy- dux  terms  might  be  able  to  be  charac¬ 
terized,  at  least  roughly,  in  terms  of  the  notion  of  ‘eddy  diffusivityh  It  wins  therefore  a 
surprise  when  even  the  signs  of  these  fluxes  sometimes  turned  out  contrary  to  expec¬ 
tation.  This  was  found,  for  example,  from  global-scale  atmospheric  ‘general  circulation 
statistics’  (e.g.  Lorenz  1967);  and  for  a  time  the  phenomenon  was  thought  of  as  a  kind  of 
mysterious  ‘negative  eddy  viscosity’  (Starr  1968).  This  meant  of  course  that  no-one  un¬ 
derstood  what  was  going  on.  Eastward  momentum  was  seen  to  be  transported  poleward 
by  large-scale  eddies,  against  the  local  mean  horizontal  shear  in  subtropical  latitudes. 
The  eddy  viscosity  became  infinite  a  little  further  poleward.  Many  other  such  exam¬ 
ples  are  now  known,  one  of  the  most  conspicuous  being  the  celebrated  ‘quasi-biennial 
oscillation’  (QBO)  of  the  equatorial  lower  stratosphere,  to  be  described  in  §4. 

The  mystery  was  gradually  solved  as  it  was  recognized  how  important  for  this  pur¬ 
pose,  as  well  as  ubiquitous,  are  the  various  wave  propagation  mechanisms  that  operate 
in  the  atmosphere  and  oceans,  such  as  internal  gravity  wave  and  Rossby  wave*  propaga¬ 
tion.  Among  the  important  pioneering  contributions  were  those  of  Charney  and  Drazin 
(1961),  Eady  (unpublished,  but  see  Green  1970),  Booker  and  Bretherton  ( 1967).  Wallace 
and  Holton  (1968),  Lindzen  and  Holton  (1968),  Dickinson  (1969),  and  Rhines  (1975). 
Today  we  have  a  relatively  clear  view  of  these  problems,  both  through  data  from  clever 
terrestrial  and  space-based  observing  techniques,  and  through  a  better  understanding 
of  the  basic  theoretical  principles  and  of  how  to  applv  both  numerical  and  idealized 
theoretical- mechanical  modelling.  Such  modelling  is  used  not  only  in  hypothesis- testing 
thought-experiments,  but  also  as  pointing  toward  better  ways  to  make  observational 
data  tell  us,  in  a  dynamically  intelligible  way,  more  about  what  is  going  on  in  the  real 
atmosphere  and  oceans  (e.g.  Thorncroft  et  a 1.  1993,  h  refs.).  Better  understanding 
includes  seeing  what  is  robust  about  an  idealized  model,  hence  which  aspects  of  it  are 
likely  to  carry  over  to  more  realistic  situations.  This  lecture  will  mention  a  few  such 
models  and  their  contribution  to  our  present-day  understanding.  Also  touched  on  will 
be  the  intimate  relation  between  wave  propagation  mechanisms  and  shear  instability 
mechanisms,  and  the  concept  of  ‘wave  breaking’  and  its  frequent  relevance  —  when 
appropriately  defined  —  to  phenomena  involving  wave-induced  momentum  transport. 

It  is  interesting  to  view  the  wind-wave  problem  from  the  conceptual  vantage  point 
thus  arrived  at.  There  emerge  what  may  turn  out  to  be  some  new  ideas  about  wave 
generation  mechanisms,  to  be  discussed  briefly  at  the  end  of  die  lecture.  Besides  being  of 
general  significance  for  our  understanding  of  ocean  waves  and  air-sea  interaction,  these 
ideas  might  help  to  explain,  for  instance,  the  growth  of  the  ‘short  wave  energy  front" 
seen  in  tbe  experiments  described  in  Professor  Phillips’  Minisymposium  Lecture  in  this 
Proceedings  and  in  Chu  et  ai.  (1992).  The  key  is  to  recognize  all  the  wave  propagation 
and  wave  breaking  mechanisms  that  come  into  play,  in  tbe  air  as  well  as  in  the  water. 


2.  THE  MIDDLE  ATMOSPHERE:  SOME  OBSERVED  FACTS 

The  phenomena  to  be  discussed,  including  the  phenomenon  of  ‘negative  viscosity", 
are  seen  very  clearly  in  what  is  now  usually  called  the  ‘middle’  atmosphere  (but  still, 
occasionally,  the  ‘upper’  atmosphere).  It  comprises  the  stratosphere,  extending  to  about 
50  km  altitude,  and  the  mesosphere  above  it,  extending  to  somewhere  between  80  and 


‘Rossby  waves  (historically,  ‘Kelvin -KirchhofF-Rayleigh  Rossby  waves’)  may  also  be  called  'vort icily 
waves’  or,  more  generally,  ‘potential- vorticity  waves’  (§6  below).  1  am  following  today’s  established 
usage. 


100km.  Figure  1  includes  the  hulk  of  the  middle  atmosphere:  ir  shows  a  typical  mean 
January  temperature  distribution  as  a  function  of  altitude  and  latitude.  The  units  on  the 
right-hand  scale  are  e-folding  pressure  scale  heights,  roughly  7  km.  s<>  that  the  altitude 
range  is  from  sea  level  to  roughly  85  km.  (Somewhere  above  this  altitude,  temperatures 
rise  steeply  into  what  is  called  the  thermosphere,  where  simple  thermodynamics  and 
continuum  mechanics  break  down  and  plasma  physics  becomes  important.)  This  sec¬ 
tion  and  §§4.8  sketch  briefly  some  of  what  is  known  and  understood  about  the  middle 
atmosphere;  a  more  extensive  discussion  and  bibliography  can  he  found  in  a  recent  re¬ 
view  of  mine  (1992)  written  for  a  non- specialist  audience  of  physicists,  and  in  other, 
more  specialized  review  material  cited  therein;  see  for  instance  the  book  bv  Andrews. 
Holton  and  Leovy  (1987). 


Figure  1  Temperatures  7  between  sea  level  and  about  85  km.  for  typical  January  conditions  (degrees 
Kelvin,  longitude  and  time  averaged)  Northward  is  toward  the  right,  and  the  right-hand  altitude  scale 
is  in  e-folding  pressure  scale  heights,  roughly  7  km.  Frotu  satellite  and  other  observations  analyzed  by 
Barnett,  and  Corney  (1985). 

The  light  shading  in  Figure  1  shows  the  warmest  regions,  and  the  dark  shading  the 
coldest,  in  January.  Some  of  these  features  can  be  understood  largely  from  considera¬ 
tions  of  radiative  heat  transport.  For  instance  the  high  temperatures  T>  260  K  near  7 
scale  heights  or  50  km,  defining  the  ‘stratopause'.  are  largely  due  to  heating  by  absorp¬ 
tion  of  solar  ultraviolet  radiation  by  ozone,  balanced  by  infrared  cooling  to  space.  T 
increases  southward  at  the  stratopause  mainly  because  the  diurnally  averaged  normal 
solar  irradiance  has  an  absolute  maximum  at  the  south  pole  at  the  December  solstice, 
and  throughout  a  substantial  part  of  December  and  January.  The  south  pole  is  then, 
so  to  speak,  the  sunniest  place  on  earth.* 

A  number  of  other  features  are  not  so  simply  explained.  One  example  is  the  very- 
cold  region  at  the  top  left  of  the  picture,  where  the  lowest  mean  temperature  plotted  is 
T  =  150  K.  In  fact  individual  rocket,  soundings  have  shown  temperatures  as  low  as  110  K 

*For  solar  declination  n  relative  to  a  spherical,  rotating  earth,  the  fractional  length  of  day  A(c>)  at 
latitude  6  is  7r~!  arccos[rnax { -  I  ,rniri(  1 .  -  tan  o  tan  d>)}].  and  the  diurnally  averaged  vortical  component 
of  solar  irradiance  is  the  full  solar  irradiance  multiplied  by  A(<J) sin o sin <f>+  7T-i  sin{7rA((A)}  eosn  oos<j. 
This  has  an  absolute  maximum  at  the  south  pole  when  nr  is  within  2. 8° of  its  minimum  value  ~23.fi0. 
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on  occasion.  At  these  altitudes,  the  sunniest  place  on  earth  is  also  the  coldest  place 
on  earth.  Until  about  a  decade  ago.  this  was  regarded  as  one  of  the  great  enigmas  of 
atmospheric  science,  the  enigma  of  the  ‘cold  summer  mesopause’.  There  is  no  doubt  that 
the  observed  temperatures  are  far  below  what  they  should  be  on  the  basis  of  radiative 
heat  transport  and  photochemistry  alone.  Temperatures  substantially  higher  than  their 
radiative  values  also  occur.  They  occur,  for  instance,  throughout  most  of  the  depth  of 
the  north-polar  region  on  the  right  of  Figure  1. 

Let  us  note  one  other  observed  fact,  at  first  sight  having  little  to  do  with  the  observed 
pattern  of  temperature  anomalies,  but  actually  closely  connected.  This  is  the  now- 
notorious  fact  ( with  its  potentially  serious  implications  for  the  stratospheric  ozone  layer) 
that  the  man-made  chlorofluorocarbons  known  as  CFC-11  and  CFC-12  have  very  long 
atmospheric  lifetimes,  of  the  order  of  a  century.  These  are  c-folding  times.  Even  if  all 
leakage  of  these  CFCs  into  the  atmosphere  could  be  stopped  tomorrow,  it  would  take 
several  centuries  for  their  concentrations  to  decrease  to,  say,  1%  of  present  values. 

How  are  these  facts  connected?  The  connecting  link  is  indicated  in  Figure  2.  which 
shows  an  estimate  of  the  mean  circulation  of  the  stratosphere  and  mesosphere.  It 
is  the  mean  in  a  sense  very  roughly  equivalent  to  a  Lagrangian  mean  with  suitable 
re-initialization  of  particle  ensembles  (see  the  caveats  in  my  1992  review  and  its  bib¬ 
liography).  This  mean  circulation  gives  us  a  roughly  correct  explanation  both  of  the 
temperature  anomalies  and  of  the  observed  lifetimes  of  CFCs.  and  the  lifetimes  of  cer¬ 
tain  other  long-lived  chemical  tracers.  The  temperature  anomalies  are  accounted  for  by 
adiabatic  expansion  in  the  rising  branches  of  the  circulation,  and  adiabatic  compres¬ 
sion  in  .ho  descending  branches,  pulling  temperatures  away  from  the  radiative  values 
toward  which  they  would  otherwise  tend  to  relax.  The  rising  branches  carry  chemical 
tracers  upward,  for  example  CFCs  and  other  tropospheric  tracers  through  the  tropical 
stratosphere.  The  rate  at  which  they  are  carried  upward  governs  the  rate  at  which  they 
are  exposed  to  sufficiently  energetic  solar  ultraviolet  radiation,  and  hence  destroyed 
photochemically.  This  is  the  main  way  in  which  CFCs  are  destroyed,  removal  rates  at 
sea  level  being  relatively  small.  The  typical  strength  of  the  mean  circulation  required 
to  account  for  the  observed  temperature  anomalies  also  gives  a  CFC  destruction  rate 
consistent  with  the  observed  lifetimes. 

Again,  the  rising  branch  of  the  circulation  at  higher  altitudes,  shown  schematically 
by  the  heavy  dashed  curve  at  the  top  left  of  Figure  2,  explains  the  extraordinarily  low 
summer  mesopause  temperatures.  The  circulation  also  carries  small  amounts  of  wa¬ 
ter  vapour  upward  to  the  mesopause.  A  phenomenon  observed  at  these  altitudes,  the 
sporadic  formation  of  ‘noctilucent’  and  ‘polar  mesospheric’  clouds,  requires  a  supply  of 
water  vapour  as  well  as  exceptionally  low  temperatures.*  Water  vapour  is  photochemi¬ 
cally  destroyed  near  the  mesopause,  where  ultraviolet  photons  are  even  more  energetic 
than  in  the  stratosphere.  Considerations  like  these,  and  other  observational  evidence, 
give  confidence  in  the  picture  suggested  by  Figure  2. 

The  mean  circulation  is  also  part  of  what  controls  the  distribution  of  ozone,  and  the 
rate  of  replenishment  of  the  ozone  layer.  The  photochemistry  of  ozone  is  complicated  and 
can  interact  more  subtly  with  the  fluid  dynamics  than,  for  instance,  CFC  destruction, 
which  depends  mainly  on  total  exposure  to  ultraviolet  photons.  Ozone  can  be  expected 
to  be  more  sensitive  to  processes  hidden  by  the  averaging  in  Figure  2;  we  return  briefly 
to  this  point  in  §8. 


*See  for  instance  Thomas  et  a/.  (1989,  &  refs.).  These  incidentally  are  the  world’s  highest  clouds,  by 
far,  occurring  at  altitudes  around  83-85  krn.  Anyone  who  lives  between  about  50°  and  60°  latitude  can 
observe  them  as  an  electric  blue  glow  above  the  poleward  horizon,  sometimes  intricately  patterned,  on 
a  few  clear  nights  after  midnight  in  the  two  months  following  midsummer. 
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Figure  2.  Light  curves:  mass  transport  streamlines  of  the  longitude  and  time  averaged  global  scale 
circulation  in  an  altitude  range  between  about  3  and  9  pressure  scale  heights  (cf.  right-hand  scale 
in  Figure  1),  estimated  from  satellite  data  for  January  1979  by  Solomon  et  aJ.  (198fi).  The  pressure 
altitude  is  in  nominal  kilometres,  defined  as  1/7  of  a  scale  height.  The  circulation  is  defined  in  a  quasi- 
Lagrangian  sense  giving  a  simplified,  but  roughly  correct,  indication  of  the  vertical  advective  transport 
of  chemical  tracers  (and  is  broadly  consistent  with  direct  observations  of  such  tracers).  The  time  for 
a  notional  fluid  element  to  rise  from  the  tropical  tropopause  to,  say.  40  km  is  typically  of  the  order  of 
two  years.  Heavy  dashed  curve:  qualitative  indication  of  the  extension  of  the  circulation  into  t  he  upper 
mesosphere  and  lower  thermosphere. 

How  is  the  mean  circulation  driven?  This  again  forces  us  to  consider  processes  hidden 
by  the  averaging.  The  key  point  is  that  the  Earth  is  a  rapidly  rotating  planet.  It  is 
rapidly  rotating  in  the  sense  that  the  distribution  of  angular  momentum  M  per  unit  mass 
in  the  atmosphere  is  dominated  by  the  Earth’s  rotation,  and  only  weakly  affected  by  air 
motion  relative  to  the  Earth.  In  the  extratropics  there  is  a  strong  latitudinal  gradient 
of  M.  So  a  circulation  like  that  shown  in  Figure  2  cannot  persist  unless  something 
exerts  a  persistent  torqiie  on  the  extratropical  atmosphere,  in  an  appropriate  sense  (in 
fact,  against  the  Earth’s  rotation  everywhere  except  in  the  summer  mesosphere,  where 
it  must  be  with  the  rotation).  In  a  thought-experiment,  in  which  this  torque  is  switched 
off,  the  circulation  begins  to  die  down,  and  temperatures  to  relax  toward  their  radiative 
values  fe.g.  Haynes  et  al.  1991,  &  refs.). 

It  is  now  believed  on  good  evidence  that  in  the  real  middle  atmosphere  this  torque 
arises  from  wave-induced  angular  momentum  transport,  in  fact  mostly  from  Rossby 
waves  and  internal  gravity  waves  generated  in  the  troposphere.  The  Rossby  waves 
account  for  most  of  the  lower  part  of  circulation  shown  in  Figure  2,  and  the  gravity 
waves  account  for  the  upper  mesospheric  branch.  The  sense  of  the  angular  momentum 
transport  is  related  to  the  phase  speeds  of  the  waves  and  to  the  different  places  where 
they  are  generated  and  dissipated,  and  not  locally  to  such  things  as  the  sign  of  the 
mean  shear.  It  is  no  wonder,  then,  that  attempts  to  apply  ideas  like  ‘eddy  viscosity' 
can  produce  incongruous  results,  such  as  negative  or  infinite  eddy  viscosities. 
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3.  WAVE-INDUCED  MOMENTUM  TRANSPORT 

The  fact  that  wave  propagation  and  diffr;  tion  are  generally  accompanied  by  a  sys¬ 
tematic  flux  or  transport  of  momentum  is  a  well-known  rule  in  theoretical  physics, 
applying  to  waves  in  fluids  as  well  as  to  the  more  obvious  case  of  waves  in  a  vacuum, 
such  as  photons.  The  key  phenomenon  is  that,  if  progressive  waves  are  generated  in  one 
place  A  and  dissipated  in  another  B,  this  is  accompanied  by  an  irreversible,  cumulative 
transport  of  momentum  from  A  to  B.  The  sense  and  rate  of  this  transport  is  usually 
given,  to  some  useful  approximation,  by  what  might  be  called  the  photon  analogy,  or 
‘quasimomentum  rule’  (§5). 

The  physical  reality  of  such  wave-induced  momentum  transport  is  easy  to  demon¬ 
strate  in  the  laboratory.  For  instance,  it  manifests  itself  in  the  phenomenon  known  as 
the  ‘sonic  wind’  or  ‘quartz  wind'  (e.g.  Lighthill  1978a, b),  in  which  a  piezoelectric  trans¬ 
ducer  emits  a  beam  of  ultrasound  that,  transports  momentum  from  the  transducer  to 
locations  where  the  waves  dissipate.  The  resulting  mean  force  on  the  fluid  often  gener¬ 
ates  a  turbulent  jet.  One  can  demonstrate  what  is  fundamentally  the  same  tiling  without 
any  special  apparatus  at  all,  using  capillary-gravity  waves  at  frequencies  of  order  5  Hz. 
This  was  done  during  the  lecture  using  a  small  cylindrical  wavemaker  (Figure  3a)  and 
a  glass  dish  containing  water  with  chalk  dust,  on  the  surface.  Making  the  wavemaker 
oscillate  vertically  shows  that  the  observed  mean  flow  (arrowed  curves)  is  predominantly 
wave-induced,  and  not  Rayleigh-  Schlichting  boundary-layer  streaming  from  the  surface 
of  the  wavemaker.  The  latter  has  the  opposite  sense  (e.g.  Lighthill  1978a,  p.348:  Van 
Dyke  1982,  Fig.  31).  Carefully  stopping  the  wavemaker  and  observing  the  persistence 
of  the  mean  flow  shows  that  it  is  more  than  a  ’Stokes  drift’:  irreversible,  cumulative 
momentum  transport  has  indeed  taken  place.  With  a  larger  area  of  water,  one  can  use 
a  curved  wavemaker  to  focus  the  waves  on  a  more  distant  spot  (Figure  3b).  illustrating 
the  fact  that  the  wave  generation  and  dissipation  sites  can  be  well  separated  spatially. 


Figure  3.  Simple  demonstrations  of  momentum  transport  by  5  Hz  surface  capillary-gravity  waves  (see 
text).  Configuration  (a)  works  well  with  a  wavemaker  whose  diameter  ~  3  to  1  cm.  and  configuration 
(b)  with  a  wavemaker  whose  radius  of  curvature  >  50cm.  From  McIntyre  and  Norton  (1990). 

This  last  point  is  underlined  by  the  classic  work  of  Munk  rt  a i.  (1963)  in  which, 
following  earlier  work  by  Barber  and  Ursell,  they  demonstrated  that  surface  gravity 
waves  generated  by  storms  in  the  Southern  Ocean  can,  and  often  do,  propagate  across 
the  Pacific  all  the  way  to  beaches  in  California,  where  they  break  and  generate  longshore 
mean  currents  (see  also  Snodgrass  ct  a/.  1966).  This  is  a  clear  case  of  irreversible  wave- 
induced  momentum  transport  over  many  thousands  of  kilometres!  The  internal  gravity 
waves  that  sustain  the  mesospheric  circulation  of  Figure  2  also  dissipate  mostly  In 
breaking.  They  have  sources  mostly  in  the  denser  layers  of  the  atmosphere  far  below, 
so  that  horizontal  momentum  is  transported  over  vertical  distances  not  of  thousands, 
but  certainly  many  tens,  of  kilometres  (e.g.  Fritts  1984,  1993). 

Essentially  the  same  things  happen  with  Ross  by  waves,  except  that  Rosshy -wave 
dynamics  has  a  peculiar  ‘one-signedness’  that  can  constrain  the  sense  of  the  momentum 


transport  in  a  ratchet-like  way.  As  well  as  being  interesting  for  this  reason,  the  case  of 
Rossby  waves  is  arguably  the  simplest  to  understand  from  first  principles.  This  will  be 
seen  in  §7.  First,  however.  I  want  to  mention  an  example  involving  internal  gravity  and 
other  kinds  of  waves,  in  which  the  wave-induced  momentum  transport  interacts  with 
wave  refraction  to  produce  an  interesting  feedback  oscillation  of  the  mean  tiox  that 
has  been  observed  both  in  the  laboratory  and  in  the  real  atmosphere  the  celebrated 
‘quasi-biennial  oscillation"  or  ‘QBO'  and  then  I  want  to  touch  briefly  on  the  photon 
analogy  and  on  what  is  sometimes  called  wave  ‘momentum'. 

4.  THE  QBO  AND  ITS  LABORATORY  ANALOGUE 

The  ‘laboratory  QBO"  was  demonstrated  in  a  famous  experiment  by  Plumb  and 
McEwan  (1973).  The  system  used  was  a  salt -stratified  fluid  contained  in  a  large  labo¬ 
ratory  annulus,  of  depth  50  cm  and  gap  width  12  cm.  Internal  gravity  waves  are  excited 
bv  making  a  flexible  lower  boundary  oscillate  in  a  standing  wave,  equivalent  to  equal 
amplitudes  of  clockwise  and  anticlockwise  progressive  waves.  The  response  of  the  fluid 
breaks  this  symmetry  (the  annulus  is  at  rest  relative  to  tne  laboratory,  and  the  Earth’s 
rotation  can  be  neglected):  and  a  wave- induced  menu  flow  arises,  horizontally  around 
the  annulus.  Soon  a  regime  is  established  displaying  a  characteristic  spacetime  signa¬ 
ture,  in  which  the  mean  flow  reverses  periodically  at  a  given  altitude,  and  does  so  earlier 
at  higher  altitudes.  This  was  illustrated  by  a  movie  of  the  original  experiment  shown 
in  the  lecture. 

The  two  mechanisms  involved  are  first  the  wave-indued  angular  momentum  trails 
port,  cumulatively  changing  the  mean  velocity  profile  as  the  waves  dissipate  (viscously 
in  this  case),  and  second  the  ('fleet  of  mean  shear  in  Dopplet  shifting  and  refracting 
the  waves  (somewhat  like  the  selective  surface-wave  refraction  that  can  make  a  batht  ub 
vortex  appear,  at  first  glance,  to  be  rotating  the  wrong  way),  in  the  Plumb  McEwan 
experiment  the  wave  dissipation  rate  is  least,  and  the  vertical  group  velocity  greatest, 
when  the  waves  propagate  against  the  mean  flow.  Such  waves  therefore  penetrate  high¬ 
est.  They  transport  angular  momentum  in  the  same  sense  as  their  intrinsic  angular 
phase  speed  and  can  therefore  reverse  the  mean  motion  where  they  dissipate,  first  at 
higher  and  then  at  lower  and  lower  altitudes  in  the  annulus.  Mathematical  models  that 
express  these  ideas  tend  to  behave  non -chaotically,  and  to  produce  the  spacetime  signa¬ 
ture  very  robustly  (e.g.  Yoden  and  Holton  1988:  Haynes  rt  a/.  1993).  as  long  as  waves 
of  sufficient  amplitude  are  excited  in  both,  senses,  clockwise  and  anticlockwise. 

The  same  spacetime  signatuie  (albeit  not  exactly  periodic)  is  conspicuously  present  in 
the  mean  east  west  winds  of  the  tropical  lower  stratosphere,  throughout  a  belt  encircling 
the  glob*',  at  altitudes  between  about  1G  and  35km  and  latitudes  between  about  ±15°. 
Whereas  in  the  laboratory  it  typically  takes  about  half  an  hour  for  the  mean  flow  to 
reverse,  in  the  real  stratosphere  it  takes  about  14  Hr  3  months.  The  wind  reversals  are 
clearly  seen  in  the  tropical  radiosonde  data  that  became  routinely  available  from  the 
early  1950s  onwards:  and  there  is  indirect  evidence  for  their  existence  many  decades 
before  then  (Hamilton  and  Garcia  1984;  Teitelbaum  rt  a  1.  1993).  Although  our  detailed 
understanding  is  incomplete,  it  seems  overwhelmingly  likely  that  the  resemblance  to 
the  laboratory  experiment  and  to  mathematical  mode's  of  it  is  not  accidental,  and 
that  the  mean  flow  changes  arc  wave  dri\<  n  in  much  the  same  way.  Indeed,  before 
wave  driving  was  thought,  of  at  all  in  tins  context  the  observed  QBO  used  to  be 

another  great  enigma  of  atmospheric  science  a  strong  case  had  already  been  made 
for  the  existence  of  some  strange  kind  of  eddy  or  fluctuation-induced  angular  momentum 
transport,  again  entailing  ‘negative  eddy  viscosity’.  Without  some  such  eddy  transport, 
it  seemed  impossible  to  make  dynamical  sense  of  the  observed  mean  flow  changes,  on 
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the  basis  of  careful  physical  arguments  and  numerical  experiments  (Wallace  and  Holton 
196S).  It  was  only  after  this  that  the  realization  came  (Liudze,.  and  Holton  19CS)  that 
wave-induced  angular  momentum  transport  could  behave  in  just  the  required  manner. 

We  do  not  yret  have  a  convincing  quantitative  model  of  the  real  QBO  including,  for 
instance,  the  effect  '  c  the  mean  upwelling  illustrated  in  Figure  2.  This  adverts  angu¬ 
lar  momentum  upward  on  a  comparable  timescale,  and  we  lack  sufficient  quantitative 
knowledge  of  its  strength  and  seasonal  variation.  There  is  also  uncertainty  over  exactly 
which  wave  types  are  significant  in  the  tropical  stratosphere.  It  used  to  be  assumed  that 
the  principal  such  wave  types  are  the  equatorially  trapped  Kelvin  and  Rossby  gravity- 
waves  (e.g.  Andrew's  et  al.  1987.  Gill  19S2).  There  is  observational  support  for  this 
in  the  case  of  the  Kelvin  wave,  whose  observed  amplitude  is  not  far  from  having  an 
appropriate  order  of  magnitude.  But  it  has  seemed  more  and  more  likely,  for  instance, 
that  planetary-scale  Rosshv  waves  originating  in  the  extratropical  troposphere  are  more 
significant  than  equatorial  Rossby  gravity  waves  meaning  locally  significant  in  the 
tropics  (e.g.  Dickinson  1968:  Lindzen  and  Tsay  1975:  Andrews  and  McIntyre  1970: 
Dunkerton  1983:  Takahashi  and  Boville  1992:  O'Sullivan  and  Hitchman  1992.  set'  §8 
below)  as  well  as  indirectly  significant  through  the  strength  of  the  mean  upwelling. 

We  have  even  less  knowledge,  either  observational  or  theoretical,  of  possible  wave 
generation  mechanisms.  The  real  QBO  seems  to  involve  highly  complicated,  chaotic, 
nonlinear  wave  ger.  ration  processes  (mainly  in  the  troposphere,  both  tropical  and  ex- 
tratropi  al)  to  which  there  is  a  robustly  noil-chaotic  response  in  the  tropical  stratosphere 
-  with,  it  might  be  added,  potentially  important  consequences  for  long-range  weather 
forecasting,  such  as  a  feedback  or  the  depth  of  cumulonimbus  convection  and  hence 
on  tropical  cyclone  intensities  and  El  Nino  timings  (Gray  et  al.  1992).  These  aspects 
remain  a  challenge  arid  opportunity  for  the  future. 

5.  LONG-DISTANCE  TRANSPORT  AND  THE  PHOTON  ANALOGY 

The  ‘photon  analogy’  or  'quasimomentum  rule"  says  that  the  rate  at  which  momentum 
is  transported  from  location  A  to  location  B.  when  a  wave  packet  is  generated  a*  A  and 
dissipated  at  B.  is  the  same  as  if 

(«)  the  fluid  were  absent,  and 

(6)  the  wave  packet  haa  a  certain  amount,  of  momentum  q  that  it 
carries  around  with  it,  like  a  photon  in  a  vacuum. 

The  quantity  q  is  not  actually'  a  momentum.  It  is  a  property  of  the  wave  packet  that  may- 
more  aptly  be  called  its  quasimomentum  or  pseudomomentum,  in  order  to  distinguish  it 
from  momentum.  Momentum,  in  the  presence  of  a  material  medium,  is  a  fundamentally 
different  quantity.*  Hence  the  alternative  term  ‘quasimomentum  rule'.  Indeed  a  better, 
more  precise  statement  is  that  “the  rate...  is  the  same  as  if 
<a)  the  fluid  were  absent,  and 

( h )  the  wave  packet  had  momentum  equal  to  its  quasimoment um.” 

The  quasimomentum  is  a  wave  property  in  the  sense  that  it  can  be  evaluated  from  lin¬ 
earized  wave  theory  alone.  In  the  simplest  theoretical  approximations  (slow  modulation 


‘This  is  because  its  conservation  corresponds  to  a  different  translational  symmetry  (of  tl.  propagating 
medium  rather  than  of  the  total  physics).  Then  is  no  widely  agreed  name  for  q.  Other  names  used 
with  different  kinds  of  waves  include  'wave- vector",  ‘Poynting's  moment  urn-.  Minkowski  momentum', 
‘radiation  momentum',  "acoustic  momentum’,  ‘crystal  momentum',  ‘phonon  momentum',  ‘tensor  mo¬ 
mentum',  ‘field  momentum’,  ‘canonical  momentum',  'wave  moment  um'.  and  'momentum'.  This  has 
led  to  some  confusion.  For  the  basics  plus  some  history  going  hack  to  the  time  of  Havleigh.  f’oyntiiig. 
Abraham,  arid  Minkowski,  and  for  keys  to  the  literature,  t  he  read  r  mav  consult  my  1081  essay  ami  its 
bibliography,  also  Loudon  and  Paige  ( 1901 ,  p. 2116),  Peierls  (1001  §§2.4  (i),  and  Shepherd  (1000.  §5), 


as  well  as  small  amplitude),  we  may  take  q  >-  /  k  when-  /-.  i>  ih<-  mtnusic  wave  enemy 
in  the  sense  <liscussed  e.g.  hy  Bret  lici  t*  m  and  (Janet t  i  lOtiS).  k  is  the  wavenumber  vet- 
tor,  and  T  is  tin-  intrinsic  frequency,  or  frequency  Doppler  shifted  to  a  reference  frame 
moving  with  the  local  mean  flow,  lake  K .  q  is  Our)  in  wave  amplitude  a. 

The  analogy  summarizes  a  hotly  of  special  and  general  results  from  theories  in  which 
a  is  considered  small  and  in  which  wave  induced  momentum  transport,  and  all  the  as 
sociated  mean  ('fleets,  tire  self- consistently  described  correct  to  Our  As  is  well  known 
the  theoretical  calculations  can  be  elaborate  and  tricky,  equatorial  Rossby  gravity  waves 
being  a  cast'  in  point  (Andrews  and  McIntyre  1978a  tj9.  iV  refs.).  Such  theories  tire  often 
qualitatively  applicable,  and  may  also  be  quantitatively  applicable  to  important  parts 
of  the  problem,  such  as  the  part  concerned  with  wave  induced  momentum  transport  he 
tween  the  sites  of  wave  generation  and  dissipation.  Amplitudes  in  the  intervening  wave 
field  can  in  some*  eases  be  truly  small  tin  the  relevant  sense,  measured  by  wave  slopes), 
as  for  instance  in  the  case  of  surface  gravity  waves  crossing  the  Pacific.  Some  aspects 
of  the  theory  and  the  photon  analogy  extend  to  finite  amplitude  as  well  (Andrews  and 
McIntyre  1978a. b).  Tin-  analogy  lias  relevance  to  all  the  situations  and  all  the  wave 
types  mentioned  in  this  lecture,  including  the  equatorial  Kelvin  and  Rossby  gravity 
waves  that  are  thought  to  contribute  to  the  real  QBO. 

But  wait.  I  hear  someone  say.  why  all  this  hair-splitting  about  an  'analogy'  why 
the  'as  if  and  the  ‘quasi-,  to  say  nothing  of  the  ‘pseudo”.'  Isn't  this  complicating 
things  unnecessarily?  Surely  wave  packets  in  a  fluid  really  do  have  momentum  q.  which 
they  really  do  carry  around  with  them,  just  like  photons  in  a  vacuum.  Is  it  not  well 
known  that  dissipating  waves  exchange  ‘their  momentum'  f meaning  q)  with  the  mean 
flow?  How  could  they  do  such  a  tiling  if  they  didn’t  really  have  momentum  q  to 
exchange?  Besides,  how  else  could  those  waves  that  propagate  across  the  Pacific,  after 
generation  by  storms  in  the  far  south,  drive  longshore  currents  on  northern  beaches? 
In  that  situation  we  do  not  have  a  steady  wavemaker.  and  we  do  not  have  a  steady 
■radiation  stress'  spanning  the  whole  ocean  for  weeks  on  end.  So  surely  wave  packets, 
and  finite  wavetrains.  must  just  carry  momentum  with  them.  And  what  about  the 
mesospheric  circulation  and  the  uoctilucenf  clouds?  Real  internal  gravity  waves  are 
highly  intermittent:  and  isolated  wave  packets  are  again,  arguably,  a  more  relevant 
idealization  than  steady  waves  spanning  the  whole  depth  of  the  middle  atmosphere. 

Well,  what  really  happens  is  interesting,  and  worth  a  brief  digression.  Take  for  in¬ 
stance  an  idealized  version  of  the  situation  in  mid-Pacifle.  Figure  4.  from  my  19S1  essay, 
shows  an  isolated,  non-dissipating,  two-dimensional  packet  of  surface  gravity  waves  on 
deep  water.  More  precisely,  it  shows  semi-schematically  the  leading-order  theoretical 
solution  describing  the  disturbance  and  its  accompanying  ()(a2 )  velocity  field,  derivable 
from  the  work  of  Longuet-Higgins  and  Stewart  (1902).  That  solution  ran  be  used  to 
compute  the  total  momentum  of  the  propagating  disturbance.  In  order  for  the  photon 
idea  to  he  literally  true,  in  the  manner  just  envisaged,  it  would  he  necessary  for  that 
momentum  to  be  well  defined  and  equal  to  q.  It  turns  out  that  in  this  case  the  momen¬ 
tum  is  indeed  well  defined.  But  a  careful  computation  shows  v  liat  Longuet-Higgins  and 
Stewart  also  found,  namely  that  the  momentum  is  not  equal  to  q  (see  figure  caption). 
To  leading  order,  it  is  zero! 

This  is  not  a  paradox.  Rather,  it  is  one  of  many  counterexamples  showing  that,  foi 
waves  in  material  media,  the  photon  idea  cannot  be  taken  literally  as  a  general  principle. 
In  order  for  the  photon  idea  to  make  sense  in  general  in  fact  to  make  sense  outside 
a  very  limited  set  of  circumstances  it  is  indeed  crucial  to  regard  it  as  an  analogy, 
i.e.,  to  retain  the  words  ‘as  if  preceding  items  (a)  and  (/#)  above,  and  to  continue  to 
recognize  the  distinction  between  momentum  and  quasinionieiitnin.  Thus  stated,  the 
analogy  is  both  useful,  and  capable  of  general  theoretical  justification.  One  approach  is 
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to  use  ‘generalized  Lagrangian  means’  in  conjunction  with  Kelvin's  circulation  theorem 
(Andrews  and  McIntyre  1978a);  the  importance  of  the  circulation  theorem  in  this  kind  of 
problem  was  recognized  by  Rayleigh  (1896),  and  its  connection  with  the  photon  analogy 
was,  I  think,  first  recognized  by  Bretherton  (1971).  Additional  considerations  that  have 
improved  our  general  understanding,  but  have  yet  to  be  fully  worked  out,  can  he  found 
in  my  paper  with  Norton  (1990).  But.  to  return  to  the  idealized  Pacific’  problem,  how 
then  does  the  whole  thing  work  fluid-rnechanically? 

The  main  point  is  this.  As  a  wave  packet  propagates  past  any  given  fluid  element 
between  its  generation  and  dissipation  sites,  it  gives  rise  to  an  ()(a~)  mean  forcing 
whose  time  integral  is  zero  for  that  fluid  element.  Details  arc  complicated  but  the  most 
significant  aspect  of  this  forcing  can  be  thought  of,  for  present  purposes,  as  corning  from 
the  divergence  of  a  radiation  stress  spanning  the  region  occupied  by  the-  wave  packet 
(and  satisfying  Newton’s  third  law  of  ‘action  and  reaction  ).  This  causes  the  given  fluid 
element  first  to  feel  a  mean  push,  and  then  a  mean  pull,  against  other  fluid  elements  in 
the  region.  Consequently,  the  time-integrated  force  on  the  given  fluid  element  is  zero. 
Corresponding  statements  are  true  of  the  other  aspects  of  the  0{o2 )  mean  forcing,  such 
as  apparent  mass  sources  and  sinks  (e.g.  Andrew’s  and  McIntyre  1978a). 

Computing  the  0(a2)  response  to  a  given  0( n1 )  mean  forcing  is  a  linear  problem. 
Therefore,  during  a  time  interval  in  which  one  or  more  wave  packets  propagate  from  A 
to  B,  one  can  regard  the  <9(«2)  mean  forcing  as  the  sum  of  two  contributions:  first  a 
steady  forcing,  corresponding  to  a  steady  wavetrain  and  its  radiation  stress  spanning  the 
entire  region  between  A  and  B  —  and  conforming  to  the  photon  analogy  and  second 
an  oscillatory  forcing  in  the  same  region  w’hose  time  integral  vanishes  everywhere. 

It  is  only  the  first  of  these  two  contributions  that  is  interesting  from  the  present 
viewpoint,  i.e.  that  corresponds  to  the  notion  of  a  cumulative,  irreversible  wave- induced 
transport  of  momentum  from  A  to  B.  The  second,  oscillatory  contribution  evokes  a 
response  that  is  non-cumulative,  because  of  the  vanishing  of  its  time  integral.  It  tends 
moreover  to  be  strongly  dependent  on  circumstances  such  as  conditions  at  remote 
boundaries,  and  how  the  waves  were  generated.  Its  details  can  ho  complicated.  For 
instance  if  one  were  to  include  stable  stratification  and  Coriolis  effects  in  a  less  idealized 
model  of  the  Pacific  ocean,  then  the  0(a2)  response  to  the  passage  of  a  non-dissipating 
wave  packet  would  be  quite  different  from  that  shown  in  Figure  4.  It  would  involve  the 
excitation  of  very  weak  0(a2)  internal  Corioiis-gravity  (inertio  gravity)  waves  over  a 
large  area  of  ocean.  In  fact  something  similar  happens  even  in  the  special  case  of  Fig¬ 
ure  4  (see  my  1981  essay  for  further  discussion)  since  in  general  there  are  very  distant, 
very  weak,  fast-propagating,  ultra-long  0(a2)  surface  gravity  waves,  which  I  have  not 
attempted  to  depict  in  the  figure  but  which  embody  significant  amounts  of  momentum, 
and  which  depend  on  how  the  wave  packet  was  generated. 

In  summary,  then,  what  is  complicated,  and  circumstance-dependent,  is  the  detailed, 
unsteady  0(cr )  mean  response  of  the  fluid  medium  to  the  generation,  propagation  and 
dissipation  of  a  wave  packet,  or  of  many  wave  packets.  What  is  simple,  and  general, 
is  the  fact  that  the  time  integral  of  this  0(n2 )  response  is  zero  apart  from  the  cu¬ 
mulative  contribution  given  by  the  photon  analogy.  This  is  true,  in  a  wide  range  of 
problems  of  this  kind,  whether  or  not  any  well-determined,  well-localized  0{o2 )  mean 
momentum  appears  temporarily  within  the  fluid  as  wave  packets  propagate  past,  or 
whether  for  instance  0(a 2)  mean  momentum  is  temporarily  taken  up  by  distant  bound¬ 
aries,  as  is  sometimes  signalled  by  divergent  momentum  integrals  in  idealized  versions 
of  the  problem  —  or  whether  0(n2)  mean  momentum  is  taken  up  by  distant  long- wave 
disturbances,  as  in  the  problem  of  Figure  4  and  its  variants. 

Another  thought-experiment  of  fundamental  interest  in  this  connection  is  to  scatter 
the  wave  packet  of  Figure  4  from  an  immersed  obstacle,  and  ask  what,  the  mean  recoil 


force  is.  One  finds  that  it  is  give  n  by  the  photon  analogy,  appropriately  re  stated.  1  his 
again  is  ‘despite’  the  fact  that,  in  this  case,  the  wave  packet  has  a  well  dctined  iiionunt  uni 
equal  to  zero.  \\  hat  happens  is  that  more  very  weak,  last  propagating,  ultra  long  wave 
0(aM )  disturbances.  of  the  kind  ;drea<ly  referred  to.  are  radiated  during  the  reflection 
process.  One  can  derive  a  general  result,  comprising  a  non  trivial  extension  of  Nuethei  s 
theorem,  that  shows  why  the  net  effect  of  all  these  O'ur  )  phenomena  must  be-  given  by 
the  photon  analogy  in  many  such  cases  including  this  one.  The  result  also  shows  why 
there  are  some  'exceptional  problems  for  which  the  analogy  fails.* 


Figure  1.  Packet  of  surface  gravity  waves  propagating  toward  the  right  in  deep  water  and  its  acrompa 
■lying  G(a2)  velocity  field  plotted  quantitatively  except  that  rite  Stokes  drift  (near  the  surface)  is  not 
depicted.  The  total  momentum  of  the  wave  packet  is  well  defined,  and  comprises  the  momentum  of  the 
Stokes  drift,  which,  for  tlie.se  particular  waves,  equals  q.  plus  the  momentum  of  the  return  flow  under¬ 
neath,  which  equals  — q  (because  return-flow  acceleration  reactions  feel  the  free  surface  as  elf'ectively 
rigid  for  further  discussion  see  McIntyre 

6.  ROSSBY  WAVES,  VORTICES  AND  SHEAR  INSTABILITIES 

Of  all  the  examples  of  irreversible  wave-induced  momentum  transport,  some  of  those 
associated  with  Rosshy  waves  are  arguably  the  simplest  as  well  as  among  the  most 
important.  Rossby  waves  and  related  phenomena  are  ubiquitous  in  the  atmosphere  and 
oceans,  and  are  fundamental  to  almost  every  aspect  of  large-scale  atmosphere  ocean 
dynamics.  For  example,  it  is  Rossby  waves  and  related  phenomena  that  drive  most  of  the 
mean  circulation  illustrated  in  the  lower  part  of  Figure  2.  The  'Rosshy  wave  elasticity' 
to  which  the  wave  propagation  owes  its  existence  is  important  also,  for  instance,  in 
strongly  inhibiting  the  turbulent  transport  of  chemicals  into  the  Antarctic  ozone  hole, 
or  of  drier  air  into  the  moist  eye  wall  of  a  tropical  cyclone.  A  still  wider  interpretation 
of  ’related  phenomena',  meaning  phenomena  depending  on  'Rosshy  elasticity',  would 

[  he*  key  step  is  to  consider  a  certain  translational  symmetry  operation  giving  a  couserval  ion  law  for 
the  sum  of  the  quasimoment  urn  of  the  waves  and  the  moment um  of  the  immersed  obstacle  There-  are 
exceptional  oases  because  this  translational  symmetry  operation  is  singular.  I  he  singularity  is  strong 
enough,  in  some  cases,  to  invalidate'  the  extension  of  Noether's  theorem.  Details  will  be  given  m  a 
forthcoming  paper  with  S.  1)  Mobbs  (  l!t‘):t)  I  discovered  one  of  these  exceptions  by  chance-  some-  tune 
age),  via  some  very  care-fill  0(ei“)  calculations  (McIntyre-  1972.  197:1).  The  existence-  e»f  siie-h  e-xe-e-pl  ie.us 
is  a  tolling  confirmatieen  that  the-  pheiton  analogy  is  only  an  anale>g\.  highly  use-tul  whe  n  valid  but  not. 
in  general,  to  be-  take-ri  life-rally. 
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include  many  important  types  of  shear  instabilities  all  the  way  from  ordinary  small 
scale  shear  instabilities  to  the  large-scale,  buoyancy-powered  ‘baroclinic  instabilities' 
that  can  lead  to  the  formation  of  common  types  of  atmospheric  and  oceanic  eddies  and 
vortices,  including  extratropical  weather  cyclones  (Hoskins  ef  a 1.  1985.  <!c  refs.). 

Rossby  waves  and  related  phenomena  occur  in  dynamical  systems  of  the  generic  form 

DQjDt  —  0,  u  =  I(<^)  ,  (C.la.b) 

where  D/Dt  =  dt  +  u  •  V  —  dt  +  udt  +  rt)v.  the  two-dimensional  material  derivative, 
and  where  I(-)  is  a  time- independent  functional  of  the  materially  conserved  scalar  field 
Q.  The  simplest  case  is  the  familiar  case  of  two-dimensional  invisrid.  incompressible 
vortex  dynamics,  for  which 

1(0)  =  (-5,,  ^)V-'2Qr  (G.2) 

where  QT  —  Q  —  vz  —  uy,  the  ordinary  vorticity  and.  to  make  the  inverse  Laplacian 
unambiguous,  suitable  boundary  conditions  are  understood  such  as  evanescence  of  juj 
at  infinity.  Note  that  (6.2)  implies  V  •  u  =  uz  +  vy  —  0.  The  next  simplest  case  is  the 
same  thing  on  a  rotating  earth  whose  vertical  component  of  absolute  vorticity  is  Q,,. 
say,  a  prescribed  function  of  horizontal  position  but  not  of  time.  Then,  retaining  the 
notation  u  =  ( u ,  v )  for  the  relative  velocity  and  Qr  —  vr  —  u„  for  die  relative  vorticity. 
we  may  take  Q  =  Qe  +  QT  in  (6.1)  and  retain  (6.2)  unchanged,  remembering  that  is 
prescribed.  The  notation  in  (6.1)  has  been  chosen  to  emphasize  the  fact  that  the  single 
scalar  field  Q  contains  all  the  dynamical  information.  At  ever}’  instant,  the  Q  field  ran 
be  ‘inverted'  to  recover  the  velocity  field  u.  One  may  call  I(  )  the  ‘inversion  functional'. 

In  more  realistic  models  of  the  rotating,  stratification-constrained,  vortical  flows  that 
occur  in  the  real  atmosphere  and  oceans,  the  same  generic  mathematical  structure  (6.1 ) 
applies,  in  many  '-ases  to  remarkable  accuracy.  This  is  why  simple  two-dimensional 
vortex  dynamics  has  always  been  such  an  important  idealization  in  the  context  of 
atmosphere-ocean  dynamics.  The  coordinates  r  and  y  now  measure  horizontally- 
projected  distances  along  the  (approximately  horizontal)  stratification  surfaces.  D/Dt 
and  a  —  (u,i>,0)  are  still  two-dimensional  on  each  such  surface,  and  uT  4-  ev  is  still 
‘zero’  to  some  useful  approximation  (more  precisely,  has  typical  magnitudes  <C  typi¬ 
cal  magnitudes  of  vx  —  uy).  What  is  new  is  that  the  inversion  functional  If-)  is  now 
three-dimensional.  Away  from  the  equator,  it  still  has  the  qualitative  character  of  (6.2) 
but  with  V"2  more  like  a  three-dimensional  inverse  Laplacian.  in  coordinates  verti¬ 
cally  stretched  by  Prandtl’s  ratio,  the  ratio  of  the  buoyancy  to  Coriolis  frequencies. 
Distortions  of  the  stratification  surfaces  are  also  determined  as  part  of  the  inversion 
operation.  One  may  generally  characterize  such  stratified,  rotating  flows  as  approxi¬ 
mately  ‘layerwise  two-dimensional'.  In  the  most  accurate  models,  which  include  models 
whose  validity  extends  into  the  tropics.  Q  is  the  Rossby -Ert  cl  potential  vorticity.  and 
inversion  is  no  longer  a  linear  operation  (e.g.  Hoskins  et  nl.  1985,  <L'  refs:  Thorpe  1985: 
Davis  1992;  Raymond  1992). 

One  has  here,  incidentally,  a  framework  for  the  general  characterization  of  coher¬ 
ent  structures  such  as  vortices  and  vortex  pairs,  in  atmosphere  ocean  dynamics 
amounting  to  variations  on  a  theme  from  Professor  Roshko’s  Opening  Lecture  in  this 
Proceedings.  For  instance  a  ‘vortex’,  in  any  dynamical  system  of  the  form  (6.1).  is 
the  coherent  structure  represented  by  a  strong,  isolated  anomaly  in  the  Q  field  together 
with  the  induced  velocity  and  any  other  relevant  fields  in  and  around  it.  where  ‘induced’ 
means  given  by  whatever  inversion  operator  I(-)  characterizes  that  dynamical  system. 
Atmospheric  cyclones  and  anticyclones,  and  oceanic  Gulf  Stream  rings  and  ‘Meddies- 
(e.g.  Armi  et  a i.  1988),  are  all  cases  in  point.  Figure  5  illustrates  one  such  structure  in  a 
model  atmosphere,  somewhat  idealized  but  instantly  recognizable  to  any  meteorologist 


familiar  with  large-scale  atmospheric  behaviour.  It  is  a  cyclonic  (earth  eo-rot atingj  ex 
tratropical  vortex  induced  by  a  strong,  compact  anomaly  in  the  Rossby  Erte!  potential 
vorticity  field  near  the  tropopau.se.  Coherent  structures  such  as  these  are  often  impor 
tant  for  weather  developments,  in  which  fast  advection  of  potential- vorticity  anomalies 
near  the  tropopause  is  an  important  aspect  of  the  dynamics,  approximately  satisfying; 
eq.  (6.1a.)  over  timescales  of  several  days. 


potential-vorticity  anomaly  (stippled  region)  near  a  model  tropopause  (heavy  curve)  across  which  the 
Rossby  -Ertel  potential  vorticity  has  a  strong  discontinuity,  by  a  factor  of  6.  The  family  of  thin  curves 
some  of  which  are  closed  are  isopleths  of  tangential  velocity,  at  .'5  ms"1  intervals  the  greatest  velocities 
>  21  ms-1  being  at  the  tropopause  —  and  the  other  family  of  thin  curves,  more  nearly  horizontal,  are 
the  stratification  surfaces.  These  are  isopleths  of  potential  temperature  0  defined  to  coincide  with 
actual  temperature  at  pressure  1000  mb  or  1000  h Pa,  plotted  at  5  K  intervals.  ‘Anomaly’  means  a 
potential-vorticity  contrast  on  a  0  surface  (Hoskins  ct  a.1.  1985,  eq.29).  The  induced  surface-pressure 
minimum  is  41  hPa  below  ambient.  The  structure  is  typical  for  middle  latitudes;  the  Coriolis  parameter 
is  10-4  s-1  (as  at  latitude  43.3°N).  The  domain  show'll  h  as  a  radius  of  2500km.  Prom  the  work  of  Thorpe 
(1985). 

Now  to  ‘Rossby-wave  elasticity’.  The  simplest  example  occurs  in  the  strictly  two- 
dimensional  dynamical  system  specified  by  (6.2)  with  Qr  =  Q  —  Q<,  =  vT  -  uy%  and  Q,. 
a  linear  function  of  y  so  that  when  u  =  0  we  have 

dQ/dy  —  dQe/dy  —  ft  —  constant  .  (6.3) 

This  is  Ross  by  \s  famous  Vi-plane’  or  ‘nearly  flat  earth'  model,  with  (x.y)  taken  to  be 
Cartesian.  Then  (6.1)  is  satisfied  in  this  case  without  linearization,  as  it  happens 

. .  by  expressions  of  the  form  u  =  {-dy,  dx)v(x,yj )  (Qr  -  V2c’)  with  <p(x,y,  *)  * 

cos  ly  cos  { k(x  —  ct)},  provided  that  the  phase  speed  c  and  the  wavenumber  components 
k ,  l  satisfy  the  dispersion  relation 

c  =  ~ft/(k2  -M2)  .  (6.4) 

The  one-signedness  previously  referred  to  shows  up  here;  (6.4)  is  a  formula  for  c,  and 
not  c2  as  in  classical  small-vibration  problems  in  non- rotating  reference  frames.  We  can 
see  the  reasons  for  the  one-signedness  and  appreciate  its  robustness  as  follows. 

By  (6.1),  the  contours  of  constant  Q  are  also  material  contours.  If  a  disturbance 
makes  these  contours  undulate  as  suggested  in  Figure  6,  then  Qr  in  (6.2)  will  be  al¬ 
ternately  positive  and  negative  as  indicated  by  the  plus  and  minus  signs  in  Figure  6. 
Then  since  ip  =  V~2Qr  the  contours  of  ip  can  be  pictured  as  the  equipotentials  of  the 
electrostatic  field  due  to  a  pattern  of  alternating  positive  and  negative  charges  (with  the 


sign  changed),  or  as  the  topographical  contours  giving  the  displacement  of  a  stretched 
elastic  membrane  that  is  pulled  up  (  — )  and  pushed  down  (  +  )  alternately  in  the  same 
pattern.  Hence  y  will  have  hills  and  valleys  centred  respectively  on  the  minus  and  the 
plus  signs,  implying  that  the  strongest  north-south  velocities  (at  right  angles  to  the 
electric  field  in  the  electrostatic  analogy)  will  occur  at  intermediate  positions,  a  quarter 
wavelength  out  of  phase  with  the  displacement,  and  in  the  sense  shown  by  the  heavy, 
dashed  arrows  in  Figure  6.  If  one  now  makes  a  moving  picture  in  one's  mind's  eye  of 
what  this  induced  velocity  field  will  do  to  the  material  contours,  one  can  see  at  once 
that  the  behaviour  must  be  oscillatory,  and  also  such  that  the  undulations  propagate 
from  right  to  left  only. 

The  same  physical  picture  exhibiting  one-way  propagation  applies  when  Q(.  --  0  but 
Q  has  a  background  gradient  dQo  / dy  due  to  a  mean  shear  flow  u  — 

dQo/dy  =  - d2u0/dy 2  =  ~wUl/v  (0.5) 

so  that  Q  —  Qo{y)  +  Q'{x,y,t)  and  y  =  —  jua(y)dy  4-  i''(x,  y.  f)  and.  for  small 
disturbances, 

( dt  +  uodz)Q'  —  UQyyii'1  x  -  0  ,  i p1  =  V  iQl  ,  (G.G) 

essentially  the  Rayleigh  equation.  It  is  no  surprise,  therefore,  to  find  that  the  simplest 
classical  shear  instabilities,  such  as  the  instability  of  the  «<>  =  tanh  y  shear  layer,  or 
its  Rayleigh  (piecewise  linear)  counterpart,  can  be  understood  (consistently  with  the 
Rayleigh,  Fjprtoft  and  Arnol’d  stability  theorems)  in  terms  of  a  coupled  pair  of  Ross  by 
waves,  in  a  certain  frame  of  reference,  each  of  which  propagates  against  the  local  mean 
flow  and  phase-locks  with  the  other  in  such  a  way  as  to  bring  it  to  rest.  Furthermore, 
each  Rossby  wave  makes  the  other  grow  exponentially,  via  a  reduced  phase  shift  between 
disturbance  velocities  v1  and  the  sideways  displacements  r)1  of  the  Q  contours  (Lighthill 
1963,  Bretherton  1966,  Hoskins  et  a i.  1985  §6b).  The  Miles  (1957)  wind -wave  instability 
is  in  some  ways  fundamentally  similar,  except  that,  one  of  the  Rossby  waves  is  replaced 
by  the  surface  gravity  wave,  and  there  is  a  mismatch  between  the  strengths  of  the  gravity 
and  Rossby  elasticities  leading  among  other  things  to  relatively  slow  growth. 

7.  ROSSBY-WAVE  BREAKING:  A  DEFINITIVE  EXAMPLE 

In  Figure  6  the  Q  (material)  contours  are  depicted  as  undulating  reversibly,  the 
situation  described  by  linearized,  dissipationless  wave  theory.  Indeed,  for  linearized 
theory  to  be  self-consistentiv  and  generally  applicable,  along  with  associated  concepts 
such  as  the  principle  of  superposition,  the  undulations  must  also  be  gentle.  Strictly 
speaking,  the  sideways  slopes  (dy/dx)Q  must  be  infinitesimal.  The  opposite  extreme, 
that  of  infinite  sideways  slopes  followed  by  sideways  overturning,  and  rapid,  irreversible 
deformation,  lengthening  and  folding  of  the  Q  contours,  is  a  commonplace  occurrence 
and  can  be  recognized  as  a  Rossby-wave  version  of  ‘wave  breaking'.  An  idealized  example 
is  shown  in  Figure  7;  see  also  Figure  9  below.  This  phenomenon,  in  a  variety  of  forms, 
is  ubiquitous  in  the  real  atmosphere  and  oceans  and  plays  an  important  role  in  the 
irreversible  transport  of  momentum  and  angular  momentum  by  Rossbv  waves. 

The  case  of  Figure  7,  known  as  the  Stewartson  Warn  Warn  (SWW)  Rossby-wave 
critical-layer  solution,  is  now  described  in  more  detail.  It  is  important  out  of  all  propor¬ 
tion  to  the  restrictive  idealizations  used  because  it  is  an  unequivocal  example,  described 
by  an  analytical  solution,  of  irreversible  wave-induced  angular  momentum  transport  due 
solely  to  wave  breaking,  with  no  other  wave  dissipation  mechanisms  involved. 

We  return  to  the  dynamical  system  described  by  (6.1)-  (G. 3),  but  now  introduce  an 
undisturbed  flow  iio(y)  having  constant  shear  uny,  say.  Again  the  Q  contours  are  made  to 
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Figure  6.  Sketch  of  the  Q  contours  and  the  Q-atiomaiy  (Q1)  pattern,  and  the  induced  velocity  field 
giving  rise  to  the  sideways  ‘Rossby  elasticity’  in  a  simple,  non-breaking  Rossby  wave 

undulate,  this  time  by  introducing  a  gently  undulating  boundary  near  some  value  yu  of  y, 
exciting  Rossby  waves  of  constant  phase  speed  c  and  y-lengthscale  UQy/ 3.  The  boundary 
displacement  amplitude  is  of  order  e  u0yj /?  where  e  is  a  small  dimensionless  parameter. 
The  x-wavelength  2n/k  of  the  undulation  is  assumed  long  enough,  2>  2ir  u0y/f3,  for 
the  inverse  Laplacian  V"2  in  (6.2)  to  be  approximated  by  (d/dy)~2,  simplifying  the 
mathematics.  (The  lengthscale  uoy//3  then  corresponds  to  the  lengthscale  in  (6.4) 
when  c  ~  l~1u0y.)  Under  these  restrictions  and  with  suitable  choices  of  yo,  this  problem 
can  be  solved  analytically  (Stewartson  1978,  Warn  and  Warn  1978).  The  Q  contours 
behave  in  an  approximately  undular  manner  except  in  a  narrow  region  surrounding  the 
critical  line,  or  y-location  where  ito(y)  =  c.  This  narrow  region,  the  ‘critical  layer’,  is 
separated  from  the  undulating  boundary  by  a  distance  >  uoy/d  and  has  width  of  order 


b  =  e1/2  u0y/l3  . 


(7.1) 


The  ‘inner  problem’  for  this  region  is  mathematically  the  same  as  the  nonlinear  pen¬ 
dulum  problem,  incompressible  fluid  flow  replacing  incompressible  phase-space  flow.  It 
is  analytically  soluble  in  terms  of  elliptic  functions,  and  the  solution  confirms  that  the 
region  is,  indeed,  a  region  of  Rossby  wave  breaking  in  the  sense  envisaged.  Figure  7 
shows  the  solution  at  four  successive  times.  Q  contours  are  overturning  sideways,  and 
deforming  in  a  manifestly  irreversible  way. 

Let  us  suppose  that  the  resulting  rearrangement  of  the  Q  field  has,  in  a  coarse- 
grain  view,  something  of  the  character  of  a  mixing  layer  as  shown  in  idealized  form  in 
Figure  8a.  Then  it  is  a  trivial  matter  to  see  that  there  must  be  an  associated  irreversible 
transport  of  momentum.  Let  6Q(y )  be  the  change  in  Q  represented  by  the  difference 
between  the  solid  and  dashed  lines  in  Figure  8a,  and  bu(y)  the  corresponding  change  in 
u.  Application  of  the  inversion  operator  (6.2)  gives  (since  6Q  —  6Q r  =  —  6uy  here)  the 
parabolic  profile 


Su(y)  ~  -  j  t>Q(y)dy  =  |  q 


2P{y2-\bJ) 


(- 

(y  <  \ bm  or  y  >  \bm) 

where  bm  is  the  breadth  of  the  mixing  layer.  Negative  momentum 


2 

\b, 


<  V  <  2  bm  ) 
1  1 


6M  ex  J  6u(y)dy  =  —  —  flb, 


(7.2) 


(7.3) 


has  been  transported  irreversibly  into  the  region  where  Q  has  been  rearranged.  This 
phenomenon  is  robust:  any  change  SQ{y)  qualitatively  like  that  in  Figure  8a  will  be 
associated  with  a  momentum  change  of  the  order  of  magnitude,  and  sign,  indicated  by 
(7.3).  For  instance  the  SWW  solution  has  a  limiting  value  of  6M  as  t  — *  oc,  which 
can  be  expressed  in  the  form  (7.3)  with  the  value  of  bm  shown  by  the  bar  at  the  centre 
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of  Figure  7d.  This  value  of  bm  may  be  thought  of  as  an  'effective  mixing  width'  for 
the  SWW  solution.  In  this  ease  the  momentum  (7.3)  has  eomo  from  the  wave  source 
comprising  the  undulating  boundary  at  large  }'.  When  ;i,  or  more  generally  the  initial 
gradient  d{Qe  +Qo)/dy ,  is  positive,  this  momentum  is  necessarily  negative  another 
manifestation  of  the  one-signedness  of  Rossby  dynamics. 

The  SWW  solution  provides  us,  incidentally,  with  a  definitive  counterexample  to 
arguments  saying  that  (^-mixing  scenarios  like  that  of  Figure  8a  are  impossible  because 
they  violate  momentum  conservation.  These  arguments  overlook  the  possibility  of  wave 
induced  momentum  transport  from  outside  the  region.  Further  historical  remarks  and 
references,  going  back  to  various  issues  surrounding,  for  instance,  G.I.  Taylor's  ‘vorticity 
transfer  theory’,  and  its  recent  developments  including  links  with  the  photon  analogy, 
can  be  found  in  my  1992  review  and  in  the  paper  with  Norton  (1990). 

Another  interesting  point  about  Figure  7  is  that  the  predicted  Q  configuration  be¬ 
comes  shear-unstable  after  the  contours  first  overturn.  So  if  any  noise  is  present  initially, 
the  actual  evolution  is  quite  different  in  detail.  In  typical  cases  the  result  is  an  appar¬ 
ently  chaotic  form  of  Rossby  wave  breaking,  and  an  increase  in  the  effective  mixing 
width  bm  by  a  modest  factor  such  that  SM  (oc  bm  3 )  increases  in  magnitude  by  a  fac¬ 
tor  2  or  3.  Figures  8b, c,  from  the  definitive  study  by  Haynes  (1989),  show  the  Q  and 
6u  profiles,  defined  as  Eulerian  .r-averages,  in  one  such  case.  The  6u  profile  has  the 
approximately  parabolic  form  suggested  by  the  idealization  (7.2). 

The  foregoing  examples  are  conceptually  important  in  another  way,  already  hinted 
at.  The  inviscid,  two-dimensional  fluid-dynamical  system  under  consideration  is  known 
to  have  mathematically  regular  behaviour,  over  arbitrarily  long  time  intervals.  The 
examples  are  therefore  cases  of  wave  dissipation  and  irreversible  momentum  transport 
that  do  not  depend  on  overtly  dissipative  processes  like  viscosity.  The  irreversibility 
involved  is  a  purely  fluid-dynamical  irreversibility,  precisely  that  associated  with  the 
persistent  lengthening  of  the  Q  contours  as  time  goes  on,  and  familiar  from  other  fluid- 
dynamical  paradigms  such  as  ‘random  straining’  and  ‘turbulence’  (e.g  Batchelor  1952). 

8.  THE  DEFINING  PROPERTY  OF  WAVE  BREAKING,  THE  STRATO¬ 
SPHERIC  ‘SURF  ZONE’,  AND  OZONE  CHEMISTRY 

What  should  one  mean  by  wave  breaking  for  general,  non-acoustic*  waves  in  fluids? 
Even  in  the  most  familiar  case,  ordinary  surface  gravity  waves,  the  phenomenon  usually 
recognized  as  breaking  has  an  extensive  ‘zoology’  of  shapes  and  time-evolutions.  The 
same  is  true  of  internal  gravity  and  Rossby  waves.  The  question  does  not  seem  to  have 
any  natural  answer  from  a  ‘zoological’  or  morphological  viewpoint.  However,  a  natural 
answer  does  suggest  itself  if  one  wants  the  concept  of  ‘wave  breaking"  to  be  relevant  to 
the  general  question  of  wave-induced  momentum  transport,  or,  more  precisely,  to  the 
question  of  when  wave-induced  momentum  transport  becomes  irreversible. 

One  can  then  use  the  rapid,  irreversible  material-contour  deformation  illustrated 
above  as  the  defining  property  of  wave  breaking.  ‘Rapid'  means  that  deformation  rates 
are  comparable,  at  least,  to  the  local  intrinsic  wave  frequency.  Such  a  definition  is  en¬ 
tirely  compatible  with  the  accepted  phenomenology,  and  relevance,  of  wave  breaking 
in  the  case  of  surface  gravity  waves  and  longshore  ocean-beach  currents.  The  case  for 
such  a  generalization  is  carefully  argued  in  three  papers  with  T.  N.  Palmer  (19S3  5). 
It  avoids  ‘zoological’  definitions,  and  requirements  to  decide  whether  ‘turbulence’  is  in¬ 
volved,  but  does  take  account  of  the  relevant  general  theorems,  particularly  Kelvin’s 


‘Shock  formation  in  acoustic  waves  seems  best  kept,  conceptually  separate,  if  only  because  its  essential 
dependence  on  the  existence  of  overtly  dissipative  processes  (microscopic  irreversibility). 


Figure  7.  Contours  of  constant  Q  for  the  SWW  solution,  a  special  hut  clear-cut  example  of  Hossby- 
wave  breaking  (contrast  the  purely  undular.  non  breaking  Hossby  wave  in  Figure  (i),  and  the  consequent 
irreversible  wave-induced  momentum  transport.  See  also  Figure!).  The  //-scale  has  been  expanded  using 
the  re-scaled  coordinate  V'  =  y/b  with  b  =  r1/3  U[\yfJ\  the  range  -5  <  V'  <  5  is  plotted.  Four  successive 
stages  in  the  evolution  are  shown,  at  times  1.  1.5,  2,  3  in  units  of  21  ^~(k  b  u<iy )~  1 ,  where  k  is  the  x- 
wavenumber.  The  vertical  bar  in  panel  (d)  gives  the  effective  mixing  scale  f>m  ~  b\  c  r.  (7.2).  (7.5)  and 
Figure  8a.  Front  Killworth  and  McIntyre  (1985),  after  Stewartson  (1978)  and  Warn  and  Warn  (1978), 


(a)  y  (b|  V  (c)  V 


Figure  8.  Q-mixing  scenarios  and  associated  momentum  change;  see  (7.2),  (7.3):  (a)  the  simplistic, 
but,  qualitatively  relevant,  idealization  that,  assumes  perfect  Q- mixing  over  width  bm,  (b)  an  actual 
Eulerian-rnean  Q(y)  profile  from  an  accurate,  quasi-ehaot  ic  Kossby-wave  critical- layer  solution  ( l\  11 
Haynes,  personal  communication);  and  (e)  the  approximately  parabolic  8«( y)  profile  eorr<'spon<litig  to 
(b),  showing  the  momentum  change  due  to  the  Hossby  wave  breaking  (first  of  (7.2)  and  of  (7.3)).  As 
in  Figure  7,  the  re-sealed  coordinate  V  —  y/b  is  used,  over  the  same  range  —5  to  5  (b  =.»'/■*  uny/.d). 

circulation  theorem  and  the  way  it  manifests  itself  in  exact,  formally  complete  theo¬ 
ries  of  wave  mean  interaction.  The  relevant  material  contours  are  defined  to  be  those 
that  would  otherwise  undulate  reversibly  under  the  generalized  elasticity,  or  restoring 
mechanism,  that  gives  rise  to  the  wave  propagation. 

Figure  9,  of  which  an  animated  version  was  seen  in  the  lecture,  shows  a  less  idealized 
example  of  Rossby  wave  breaking  and  its  effects,  taken  from  the  work  of  Norton  ( 1993). 
The  parameter  conditions  are  far  closer  to  those  in  the  real  stratosphere  than  those 
assumed  by  critical-layer  theory.  Here  the  dynamical  system  comprises  the  shallow 
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water  equations  on  a  sphere,  solved  numerically  by  a  high- resolution  pseudospectral 
method.  The  projection  is  polar  stereographic,  and  the  winter  northern  hemisphere 
is  shown.  An  axisymmetric  initial  state  is  disturbed  by  smoothly  distorting  the  lower 
boundary  in  a  large-scale  pattern  so  as  to  imitate  the  effect,  on  the  real  stratosphere,  of 
planetary-scale  Rossby  waves  propagating  up  from  the  much  denser  troposphere  below 
(Charney  and  Drazin  1961).  The  result  looks  remarkably  similar  to  what  is  seen  in  the 
real  winter  stratosphere  at  altitudes  of  the  order  of  25  t,o  50  km. 

The  left  panel  of  Figure  9  shows  Q ,  now  the  shallow-water  potential  vorticity,  defined 
as  absolute  vorticity  over  local  layer  depth.  The  model  problem  still  has  the  generic  form 
(6.1)  to  excellent  approximation,  but  with  a  shorter-range  potential-vorticity  inversion 
operator*  I(-).  The  central  region  is  the  model’s  ‘stratospheric  polar  vortex’,  where  the 
relevant  material  contours,  which  in  this  experiment  lie  initially  along  latitude  circles, 
are  almost  coincident  with  the  Q  contours  and  undulate  nearly  reversibly.  In  this  region 
there  is  little  Rossby-wave  breaking.  This  is  illustrated  by  comparison  with  the  right 
panel,  which  shows  the  behaviour  of  some  material  contours  computed  very  accurately 
using  a  high-precision  ‘contour  advection’  technique  adapted  from  the  work  of  Dritschel 
(1988).  Outside  the  polar  vortex,  in  middle  latitudes,  is  a  region  in  which  the  waves 
are  breaking  vigorously.  There,  the  initially-latitudinal  material  contours  are  deformed 
rapidly  and  irreversibly,  and  mixed  into  a  broad,  2D-turbulent  ‘Rossby-wave  surf  zone'; 
e-folding  times  for  contour  lengthening  were  estimated  to  be  about  4  days  (Norton,  op. 
cit.).  This  ‘surf  zone’  is  the  real-stratospheric  counterpart  of  the  idealized  Rossby-wave 
critical  layer.  It  is  far  broader,  and  very  different  in  detail,  but  it  illustrates  equally 
well  the  robustness  and  one-signedness  of  the  angular  momentum  transport  associated 
with  irreversible  ^-rearrangement.  As  already  suggested,  such  Rossby-wave  breaking 
is  an  important  part  of  how  the  mean  circulation  illustrated  in  Figure  2  is  driven. 
Furthermore,  its  recognition  in  models  of  the  global-scale  stratospheric  transport  of 
trace  chemicals  like  CFCs  is  beginning  to  lead  to  improved  realism  in  the  predictions 
of  such  models,  both  via  a  more  realistic  mean  circulation,  and  also  via  a  more  realistic 
representation  of  the  quasi-horizontal  turbulent  transport  (Garcia  et  a 1.  1992). 

This  latter  aspect  may  be  especially  important  for  ozone  photochemistry  (and  rele¬ 
vant  to  some  current  controversies  about  ozone  depletion  — -  see  §10  of  my  1992  review). 
This  is  because  ozone  photochemistry  by  its  nature  could  be  more  sensitive  than,  for 
instance,  CFC  photochemistry,  to  the  timing  of  a  typical  molecule’s  excursions  across 
the  ‘surf  zone’.  Timescales  for  such  excursions  are  comparable,  at  certain  altitudes,  to 
photochemical  timescales.  Together  with  the  mean  circulation  itself,  these  complicated 
fluid  motions  control  the  rate  at  which  ozone  is  produced  photochemically,  mainly  in 
the  high  tropical  stratosphere,  and  carried  thence  to  the  extratropical  kwer  strato¬ 
sphere  where  it  accumulates  (unless  destroyed  by  ‘ozone-hole  chemistry’)  in  far  greater 
concentrations  than  can  be  produced  by  tropical  photochemistry  alone. 

Model  simulations  like  that  of  Figure  9  are  also  relevant  to  understanding  the  Antarc¬ 
tic  ozone  hole.  The  contours  in  the  right-hand  panel  of  Figure  9  can  be  regarded  as  iso- 
pleths  of  an  advected  passive  tracer,  the  advection  being  very  accurately  simulated,  with 
no  artificial  diffusion.  The  simulation  shows  that,  at  least  in  the  model,  chemical  sub¬ 
stances  in  the  surf  zone  do  not  penetrate  past  the  region  of  strong  Rossby  elasticity  con¬ 
centrated  in  the  steep  Q  gradient  near  the  vortex  edge.  This  is  believed  to  be  important 


*To  rough  approximation,  this  I(  )  is  given  by  (6.2)  with  V~~2  replaced  by  (V2  -  «2)-1 .  corresponding 
to  an  elastic  membrane  tethered  by  local  springs,  somewhat  like  a  spring  mattress,  with  a  latitude- 
dependent  e-folding  scale  k-1  (the  ‘Rossby  radius’)  of  about  1400  km  at  the  pole,  2000  km  at  45°N, 
and  3000  km  in  the  tropics.  McIntyre  and  Norton  (1990,  1993)  give  examples  of  much  more  accurate 
(but  much  more  elaborate)  potential-vorticity  inversion  operators  !(•)  for  shallow- water  models. 
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Figure  9.  Winter  hemisphere  in  a  high-resolution,  shallow-water  numerical  model  of  the  stratosphere, 
from  Norton  (1993).  The  mean  depth  is  4  km,  and  the  numerical  resolution  (triangular  truncation 
at  total  wavenumber  127)  corresponds  to  a  mesh  size  roughly  1°  latitude.  Left  panel:  potential  vor- 
ticity  Q ,  contour  interval  4  x  10-9m-1s-1,  zero  contour  dotted.  (Negative  Q  values  would,  in  a 
three-dimensional  model,  signal  the  three-dimensional  mode  of  Rossby-wave  breaking  pointed  out  by 
O’Sullivan  and  Hitchman  1992.)  Right  panel:  isopleths  of  an  adverted  tracer  field,  initially  axisym- 
metric  and  coincident  with  the  Q  contours,  showing  the  fluid-dynamical  irreversibility  characteristic  of 
Rossby-wave  breaking  and  two-dimensional  ‘turbulence’.  This  was  computed  with  near-perfect  accu¬ 
racy  using  a  high-precision  ‘contour  advection’  techique,  introduced  independently  by  Norton  (1993) 
and  by  Waugh  and  Plumb  (1993)  using  an  algorithm  developed  in  another  context  by  Dritschel  ( 1988). 

for  ozone-hole  chemistry.  The  same  phenomenon  has  been  demonstrated  in  the  labora¬ 
tory  by  Sommeria  et  a 1.  (1989,  1991).  The  contour- advert  ion  technique,  conceived  of  as 
a  benchmark  numerical  tracer  advection  algorithm,  was  introduced  independently  by 
Norton  (op.  tit.)  and  by  Waugh  and  Plumb  (1993). 

Recently,  O'Sullivan  and  Hitchman  ( 1992)  have  shown  that  an  entirely  different,  mode 
of  Rossby-wave  breaking  is  possible  near  the  equator,  where  the  potential  vorticity 
itself  changes  sign.  It  conforms  to  the  general  wave-breaking  definition,  with  three- 
dimensional  rather  than  layerwise-two-dimensional  material  contour  deformations,  aris¬ 
ing  from  an  asymmetric  inertial  (quasi- centrifugal)  instability.  Among  other  things,  this 
may  have  new  implications  for  the  quantitative  modelling  of  the  QBO. 

9.  WIND-GENERATED  WATER  WAVES:  TWO  NEW  MECHANISMS? 

What  does  this  old  but  elusive  problem  look  like  from  the  foregoing  perspective?  The 
first  point  is  that  Rossby-wave  dynamics  is  involved,  albeit  on  much  faster  timescales 
than  before.  The  velocity  profile  in  the  air.  whether  or  not  approximately  logarithmic, 
will  usually  have  a  strong  curvature  u o,,y  near  the  water  surface  (y  vertical).  Therefore 
there  is  a  vertical  gradient  of  spanwise  vorticity  and  hence,  in  the  present  language,  a 
Rossby  elasticity,  in  the  airflow  just,  above  the  water  waves;  recall  (G.5)  and  Figure  G. 
That  is  why  the  Miles  inviscid  wind  wave  instability,  for  instance,  can  be  regarded  as 
a.  coupled  ’Rossby-wave,  gravity-wave’  instability.  One  can  think  in  terms  of  a  pair  of 
phase- locked,  counterpropagating  waves,  a  backward-propagating  Rossby  wave  in  the 
air  coupled  to  a  forward- propagating  gravity  wave  in  the  water.  (Other  examples  of 
Rossby-wave,  gravity-wave  instabilities  go  back  to  G.  I.  Taylor's  work  in  the  1930s  on 
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the  ‘Taylor -Goldstein  equation';  see  also,  e.g.,  Griffiths  ct  a 1.  (1982),  Hayashi  and  Young 
(1987),  and  Sakai  (1989).)  In  the  Miles  instability,  the  Rossby  elasticity  is  relatively 
weak.  This  implies  not  only  that  the  water  wave  largely  determines  the  phase  speed  c. 
but  also  that  the  growth  rate  is  slow  and  that  the  most  significant  Rossby  effects  occur 
near  the  critical  line  Uo(y)  =  c,  where  intrinsic  phase  speeds  are  slow.  The  mathematical 
underpinning  for  these  statements  is  well  known  and  is  given  in  detail  by  Miles  ( 1957) 
and  Lighthill  (1962). 

Now  the  self-consistency  of  the  linearized  instability  theory  requires  that  conditions 
near  the  critical  line  resemble  those  in  Figure  7a.  In  the  present  language,  the  Rossby 
wave  is  not  only  weak  and  slow,  but  is  also  in  the  earliest  stages  of  wave  breaking, 
in  the  generalized  sense  already  referred  to.  The  later  stages  can  be  expected  to  look 
more  like  those  of  Figure  7b-7d  and  beyond,  with  a  small  value  of  the  effective  mixing 
scale  bm ,  viewed  in  a  suitably  undulating  coordinate  system.  This  suggests  that  the 
idealized  scenario  represented  by  the  inviscid  Miles  mechanism  lacks  robustness,  when 
taken  literally.  The  delicate  phase-locking  and  synchronization  of  the  normal  mode’s 
displacement  fields  will  be  disrupted  by  Rossby  wave  breaking  before  much  vertical 
rearrangement  of  spanwise  vorticity  can  take  place  (and  long  before  the  water  waves 
break).  The  outcome  will  be  net  vorticity  rearrangement  ,  in  the  airflow,  over  only  a  small 
effective  mixing  depth  6m,  of  the  same  order  as  the  scale  b  given  by  (7.1)  except  that 
the  lengthscale  uQy/(i  is  replaced  by  a  typical  value  of  -  u0y/u0yy  near  the  critical  line. 
There  is  a  correspondingly  small  net  wave-coherent  momentum  transport  6M  oc  bm  1 
from  air  to  water,  having  the  order  of  magnitude  implied  by  (7.3)  with  3  again  replaced 
by  a  typical  value  of  —Uoyy  near  the  critical  line. 

However,  there  is  one  important  feature  of  this  idealized  scenario  that  does,  on  the 
other  hand,  look  robust.  This  is  the  sign  of  the  net  wave-coherent  momentum  transport, 
which  tends  to  be  such  that  the  water  waves  are  amplified  when  travelling  in  the  same 
direction  as  the  wind.  More  precisely,  the  sign  is  determined  by  the  sign  of  —n0yy  and 
the  one- signedness  of  the  associated  Rossby-wave  dynamics  —  the  same  one-signedness 
that  so  strongly  controls  the  sense  of  the  mean  circulation  throughout  most  of  Figure  2. 
In  the  wind-wave  problem  the  sign  is  determined  quite  independently  of  whether  or  not 
we  have  strictly  i-periodic  waves  and  delicately  synchronized  exponential,  normal-mode 
growth. 

This  sign-robustness  must  imply  a  kind  of  ‘ratchet  effect’.  Almost  any  spatio- 
temporal  intermittency  —  whether  it  be  any  tendency  of  the  water  waves  to  arrange 
themselves  in  groups  through,  for  instance,  subharmonic  instabilities,  or  any  gustiness 
of  the  wind  that  might  be  modelled  as  an  intermittency  in  quantities  like  —uoyy  -- 
will  tend  to  favour  intermittent  wave-coherent  momentum  transport  whose  effects  are 
cumulative.  They  may  possibly  also  be  such  as  to  reinforce  the  intermittency.  There  is 
no  longer  any  reason,  moreover,  why  the  effective  depths  bm  associated  with  any  Rossby 
wave  breaking  should  be  especially  small. 

More  generally,  the  sign-robustness  points  to  the  likely  effectiveness  of  any  process  in 
which  wave-coherent,  undulation  of  the  airflow  over  the  water  waves  causes  Rossby  wave 
activity  in  the  air  to  increase  in  total  amount,  or  to  dissipate,  or  both.  The  relevant 
measure  of  ‘activity’  for  this  purpose  is  a  suitably  defined  quasimomentum  (Killworth 
and  McIntyre  1985;  Shepherd  1990,  &  refs.),  since  the  photon  analogy  can  be  shown 
to  apply  here,  in  the  required  sense.  Any  such  process  will  result  in  wave-coherent 
momentum  transport  in  such  a  sense  as  to  amplify  the  water  waves.  This  could  well  be 
important  for  the  real  wind-  wave  problem,  despite  the  added  complexities  of  turbulence 
and  other  three-dimensional  effects.  There  appear,  in  particular,  to  be  two  distinguish¬ 
able  mechanisms  whose  possible  role  deserves  closer  attention,  both  theoretically  and 
experimentally. 


One  is  simply  local  Rossbv  wave  breaking  characterized  by  lelatively  large  effective 
depths  bm.  If.  for  instance,  a  gust  produces  an  effective  —  uQlJg  that  then  encounters 
a  wave  group,  leading  to  rapid  vertical  rearrangement  of  spamvise  vorticity  (reducing 
— u o v w ) ’  then  the  water  waves  will  be  correspondingly  amplified  for  a  short  time.  The 
transient  nature  of  such  phenomena  would  add  to  the  well-known  difficulties  of  observing 
the  associated  pressure  phase  shift  experimentally.  The  other  mechanism,  which  in 
reality  might  tend  to  occur  at  the  same  time,  is  the  formation  of  Rossby  lee  waves  in 
the  airflow.  This  will  give  rise  to  "wave  resistance'  in  the  usual  way.  not  depending  on 
wave  breaking  of  any  kind.  Indeed  the  lee-wave  mechanism  should  be  at  its  most  efficient 
when  both  Rossby  wave  breaking  and  water  wave  breaking  are  unimportant.  Then  one 
should  get  unseparated  flow  over  the  water  waves  (e.g.  Banner  1990.  refs.)  and. 
for  at  least  some  —uoyy  profiles,  an  efficient  shedding  of  Rossby -wave  quasimomentum 
into  the  downstream  airflow.  It  seems  possible  that  the  amplifying  wave  group  in  the 
experiments  of  Chu  et  a l.  reported  by  Professor  Phillips,  which  they  call  the  ‘short  wave 
energy  front',  might  depend  on  the  Rossby  lee- wave  mechanism  and  the  experiments 
might  offer  a  chance  to  study  it  under  controlled  conditions. 

These  ideas  might  also  help  resolve  some  longstanding  questions  about  the  effects 
of  three-dimensional  turbulence  on  different  timescales,  from  the  relatively  long,  ‘dis¬ 
sipative'  timescales  required  to  restore  —  u0yy  profiles,  to  the  relatively  short,  'elastic' 
timescales  on  which  rapid-distortion  theory  might  be  appropriate,  merely  modifying 
the  Rossby  elasticity  in  the  air  —  as  with  internal  gravity  waves  in  the  important 
observational  study  by  Finnigan  and  Einaudi  (1981),  in  which  eddy- viscosity  or  other 
steady-state  dissipative  turbulent  modelling  was  shown  to  be  wrong  by  ‘an  order  of  mag¬ 
nitude’.  In  some  wind-wave  models,  turbulent  processes  are  modelled  semi-empirically 
for  instance  by  adding  something  like  an  eddy  viscosity  to  the  Miles  theory.  Such  de¬ 
vices  (drastically  modifying  the  type  of  behaviour  suggested  by  Figure  7)  would  seem 
unavoidable  in  models  that  insist  on  r-periodicit.y.  or  on  horizontal  statistical  station- 
arity  and  linear  superposability  of  wave-coherent,  processes.  But  a  physically  correct 
turbulence  model  might  need  to  bring  in  a  wider  range  of  timescales.  This  suggests 
that  it  could  be  fruitful  to  try  to  model  the  spatio-temporal  intermit tency  of  the  water 
waves  and  the  airflow  explicitly. 
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Abstract 

The  determination  of  the  mechanical  response  of  a  body  subjected  to  some  prescribed 
history  of  loading  is  the  classical  problem  in  the  analysis  of  elastic-plastic  bodies.  Closed 
form  solutions  are  not  possible,  except  in  a  very  limited  number  of  problems,  and  numer¬ 
ical  methods  must  be  adopted.  Spatial  discretisation  is  most  commonly  carried  out  by 
the  finite  element  method,  and  the  process  is  well  understood.  Time  discretisation,  and 
the  associated  algorithms  to  advance  from  one  time  step  to  the  next,  however,  have  been 
the  subject  of  considerable  discussion  in  recent  years. 

Work  contributing  towards  this  problem  has  been  carried  out  in  the  Centre  for  Research 
in  Computational  and  Applied  Mechanics  at  the  University  of  Cape  Town,  resulting  in 
a  coherent  overall  framework.  This  paper  is  intended  to  summarise  and  consolidate  the 
major  features  of  this  framework. 

The  process  begins  with  the  time  discretisation  of  the  continuous  problem.  The  backward 
difference  algorithm  is  adopted  because  of  its  relevance  in  plasticity:  this  step  is  equivalent 
to  replacing  the  elastic-plastic  material  by  a  nonlinear  elastic  material  for  the  time  step. 
The  minimisation  of  the  potential  energy  of  this  nonlinear  elastic  problem  leads  to  a 
nonlinear  programming  problem.  If  the  elastic-plastic  material  is  stable,  the  nonlinear 
programming  problem  will  be  convex. 

The  minimisation  of  the  programming  problem  can  be  carried  out  by  a  two  step  iterative 
process  which  can  be  identified  with  the  conventional  Newton-Raphson  algorithm.  In  the 
first  (predictor)  step,  the  nonlinear  functional  is  approximated  by  a  quadratic  functional 
whose  minimisation  is  a  linear  problem.  The  choice  of  quadratic  functional  affects  the 
convergence  of  the  problem;  several  choices  are  possible.  The  second  (corrector)  step 
involves  the  calculation  of  the  properties  of  the  nonlinear  functional  for  the  state  predicted 
by  means  of  the  approximation;  this  requires  a  local  calculation  and  can  be  identified  with 
the  return  algorithm. 

The  spatial  discretisation  is  now  introduced  into  the  predictor  step  -  this  is  essentially  the 
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discretisation  of  a  linear  problem.  The  final  finite  element  formulation  is  thus  achieved. 
Several  other  questions  are  amenable  to  analysis  in  this  approach,  and  generalisation  to 
trapezoidal  and  midpoint  rules  is  possible.  These  issues  will  be  discussed  briefly. 


1  Introduction 


Our  purpose  in  this  paper  is  to  summarise  and  review  work  carried  out  at  the  Centre 
for  Research  in  Computational  and  Applied  Mechanics  at  the  University  of  Cape  Town 
in  recent  years  (1  -  21]  on  the  topic  of  the  time  integration  of  the  classical  problem 
in  plasticity.  What  is  sought  is  insight  into  the  mechanics  of  the  integration  process,  as 
opposed  to  a  purely  mathematical  approach  to  the  integration  of  the  differential  equations. 

The  work  referred  to  in  [1-21]  is  framed  largely  in  terms  of  an  internal  variable  theory  of 
plasticity:  this  theory  has  provided  several  insights  which  are  not  immediately  clear  in 
the  conventional  formulation.  In  this  paper,  however,  we  will  present  the  concepts  in  a 
conventional  formulation. 

We  shall  consider  an  elastic-plastic  body  subject  to  some  quasi-static  loading  history. 
We  shall  assume  that  the  displacements  are  small,  and  that  the  material  is  stable.  The 
basic  incremental  equations  will  be  formulated,  and  time  discretisation  will  be  achieved 
by  using  the  backward  difference  scheme.  On  this  basis  we  will  show  that  the  incremental 
elastic-plastic  problem  reduces  to  a  nonlinear  elastic  or  holonomic  problem,  with  which  a 
convex  nonlinear  programming  problem  can  be  associated. 

We  will  then  discuss  a  two  step  iterative  algorithm  for  the  minimisation  of  the  program¬ 
ming  problem;  this  can  be  identified  with  the  conventional  Newton-Raphson  method. 
Issues  of  convergence  can  also  be  clearly  understood-  Finally,  the  backward  difference 
approximation  can  be  relax -d,  giving  an  equivalent  trapezoidal  and  midpoint  rule. 


2  '  Formulation  of  the  incremental  problem 

We  consider  a  body  of  volume  V  and  surface  5.  The  coordinates  of  a  point  in  the  body 
are  denoted  by  x,,  i  =  1,2,3,  and  time  is  denoted  by  t.  The  strain  -  displacement 
relations  are 


..  =  _  (!£}*  ,  9ui\ 

>J  2  \dx}  dxl  J  ’ 


(1) 


where  u,(xj,<)  is  the  displacement  field  and  e,j(x,,  t)  is  the  strain  tensor. 
The  equilibrium  equations  are 


dxj 


+  Fi  ~  0  on  V, 


(2) 
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where  Fx(x,,t)  are  body  forces  and  a,j(xl,l)  is  the  stress  tensor.  In  addition 

=  Pi  on  5,  (3) 

where  Pi(x<,  t)  are  surface  tractions,  and  ut  is  the  out  ward  normal  to  the  surface. 

The  loading  history  comprises  specified  body  forces  F,(x,,t)  on  V,  surface  tractions 
Pi(xi,t)  on  part  of  the  surface  Sp,  and  prescribed  displacements  u,(x,tt)  on  the  remainder 
of  the  surface  Su.  For  simplicity  we  shall  assume  that  u,(xt,  t)  —  U  on  Su. 

The  governing  equations  are  completed  by  the  constituitive  relations.  We  assume  that 
the  strain  e,-,-  can  be  divided  into  elastic  and  plastic  components,  tf,,  tf,: 


The  elastic  strain  is  given  by 


£ i ,  ki  &kli 


where  AjH  is  the  compliance  tensor.  We  define  a  yield  function 

<t>  =  <f>(crij,  Ha ) 


where  Ha  are  internal  variables  which  control  hardening.  Then 


j  =\J?t 

u  day' 


A  =  0  if  4>  <  0  (8) 

dd> 

or  4>  =  0  and  — —  oy,  <  0.  (9) 

dcr.j 

These  are  the  cases  of  elastic  behaviour  and  unloading  from  a  plastic  state.  For  the  Cc.se 
of  plastic  loading, 


dd> 

A  >  0  for  4>  —  0  and  — — a,;  >  0. 

dal} 


We  note  that  during  plastic  deformation 


Depending  on  the  specific  form  of  Ha,  and  ensuring  that  equation  (11)  is  satisfied,  we 
may  show  that 


\  —  G  >  0.  (12) 

In  addition,  we  must  specify  evolution  equations  for  the  internal  variables  Ha.  We  note 
that  Ha  —  0  for  <f>  <  0  or  <p  =  0  and  (d(}>/dax])ax3  >  0. 

The  constitutive  equations  are  invertible,  and  may  be  written  in  the  form 

aij  =  B,jki  e'ki ,  (13) 

where  Bxjk\  —  Ha).  For  stability,  we  require  that  cr',;  eXJ  >  0,  and  hence  that  Bxjkl 

is  positive  semi-definite.  Equation  (13)  can  be  integrated  with  respect  to  time  along  any 
given  strain  path  in  order  to  determine  the  stress  response. 


3  Discretisation  in  time 

We  now  introduce  a  time  discretisation.  We  suppose  that  we  know  the  solution  at  time 
in- i  and  we  denote  the  fields  by  F,n_1 , p"-1 ,  u"-1 ,  e?-1 ,  cr*~l.  At  a  subsequent  instant  tn 
we  know  F" ,  p?  on  5P,  u"  on  5„,  and  we  must  find  u"  on  V  and  Spx  e oT-y 

Let 


Au?  =  un  -  un-\  Aen  =  en  -  en_1 

*  1  V  ij  tj 

The  kinematic  equations  (1)  are  linear,  and  hence  we  can  write  for  time  tn: 


(14) 


S— 

<,  ",  =  Pi 


on 

on 


V, 

S, 


M ) 


1 

2 


/  dAu l  dAu?  \ 

\  dxj  +  dxx  J  ' 


(15) 

(16) 
(17) 


To  these  equations  we  must  add  a  set  of  discrete  constitutive  equations  relating  Ae";  and 
&ij  in  order  to  complete  the  governing  equations  for  time  tn. 
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4  Discrete  constitutive  relations 

As  a  result  of  the  time  discretisation,  we  must  introduce  an  additional  assumption  in  order 
to  be  able  to  integrate  the  constitutive  equations  (13)  over  the  interval  t„_i,  tn.  Physically, 
this  can  be  interpreted  as  choosing  a  path  in  strain  space  along  which  the  strain  changes 
from  the  previous  value  e”-1  to  the  new  value  t”,  as  shown  diagrammatically  in  Figure  1. 


Figure  1:  Assumed  path  in  strain  space 


There  are  a  variety  of  possible  choices,  but  one  which  has  particularly  desirable  properties 
is  the  minimum  work  path  [22,12].  The  minimum  work  path  is  defined  as  a  path  in  strain 
space  for  which  the  integral 


Wn  = 


(18) 


takes  its  least  value  IV"  for  given  r’)~'  and  arbitrary  e".  The  integral  VFn  is  evaluated 
by  integrating  the  constitutive  equation  (13)  along  the  path.  The  minimum  work  path  is 
not  unique,  in  the  sense  that  there  may  exist  a  family  of  paths  for  which  the  work  is  the 
least  value  W°. 


The  function  W°  has  several  important  properties.  If  the  elastic-plastic  material  is  stable, 
1)}  =  W°{  Ac”,  //„(<„-!)}  will  be  a  convex  function  of  Ac". 
Further,  it  is  a  potential  function  which  provides  the  stress  associated  with  c"  at  the  end 
of  the  path.  Thus 
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dW° 


dAetJ 


(10) 


For  conventional  plastic  materials,  the  minimum  work  path  [23]  is  characterised  by  a 
straight  line  path  in  internal  variable  space.  Alternatively,  it  is  a  path  such  that  the 
plastic  strain  direction  is  fixed.  This  implies  that,  for  a  stress  change  from  orf~l  to  <7", 
the  path  in  stress  space  ,  as  shown  in  Figure  2,  is  divided  into  two  parts:  first  an  elastic 
segment,  with  the  stress  path  remaining  inside  the  yield  surface,  and  second,  a  segment 
in  which  the  stress  remains  constant  at  yield  (for  an  elastic,  perfectly  plastic  material)  or 
moves  out  along  a  normal  to  the  yield  surface  for  a  hardening  material.  Integration  of 
the  constitutive  equations  along  this  path  is  generally  fairly  straightforward,  and  in  most 
cases  equation  (18)  can  be  evaluated  explicitly. 


Figure  2:  Minimum  work  path  for  an  elastic,  perfectly  plastic  material 

In  the  case  of  an  elastic,  perfectly  plastic  material  the  path  in  stress  space  is  particularly 
straightforward  (see  Figure  2):  a  stress  change  occurs  first,  accompanied  by  elastic  strain 
changes,  and  the  the  stress  remains  constant  while  plastic  strain  changes  take  place.  It  is 
evident  from  Figure  2,  for  an  elastic,  perfectly  plastic  material,  that  if  the  yield  surface 
is  convex,  we  may  write 

K;  -<r3TK>0.  (20) 

This  may  be  recognised  as  an  incremental  form  of  the  principle,  of  maximum  plastic  work ; 
it  is  the  existence  of  this  inequality,  and  its  appropriate  generalisation  for  hardening 
materials,  which  leads  to  the  convexity  of  W°. 
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The  minimum  work  path  assumption  in  fact  corresponds  to  a  backward  difference  assump¬ 
tion  for  plastic  strain  rates:  if  we  were  to  put 

efJ  =  {(l-a)^ri|+ai!"},  (21) 

and  to  integrate  the  constitutive  equation  (13)  over  the  interval  <n_, ,  tn ,  we  would  recover 
the  minimum  work  path  with  a  =  1. 


5  The  incremental  problem 


The  incremental  problem  is  now  defined  in  terms  of  the  field  equations  (15-17)  and  the 
constitutive  equation  (19).  We  may  recognise  that  the  constitutive  equation  is  essentially 
a  stress-strain  relation  for  a  nonlinear  elastic  or  holonomic  material,  with  W°  playing 
the  role  of  a  strain  energy  function.  Thus,  as  a  result  of  the  minimum  w'ork  path  or 
backward  difference  assumption,  the  incremental  problem  in  plasticity  is  transformed 
into  an  incremental  holonomic  or  nonlinear  elastic  problem. 

The  problem  may  be  equivalently  stated  in  the  form  of  a  programming  problem,  in  which 
we  are  required  to  minimise  the  potential  energy  of  the  holonomic  problem.  The  solution 
to  the  incremental  problem  is  given  by  the  fields  An",  Ac”  which  give  the  least  value  of 
the  functional 


=  f  W^effdV  -  /  F” AuidV  —  /  pfAuidS. 

J  V  JV  v  Sn 


(22) 


The  functional  U°  is  convex  if  W°  is  convex.  When  the  incremental  solution  is  obtained, 
equation  (22)  is  updated,  with  (n  +  1)  replacing  n. 


6  Solution  algorithm 

We  now  consider  an  appropriate  algorithm  for  finding  the  least  value  of  the  functional 
U°.  The  simplest  and  most  direct  algorithm  is  a  two  step  algorithm.  In  the  first  step, 
which  we  shall  refer  to  as  the  predictor  step,  we  approximate  the  nonlinear  functional 
U°  by  a  quadratic  functional:  the  minimisation  of  this  quadratic  approximation  is  then  a 
linear  problem,  and  leads  to  improved  estimates  of  the  kinematic  variables.  In  the  second 
step,  which  will  be  referred  to  as  the  corrector  step,  we  recompute  those  properties  of  the 
functional  U°  which  are  needed  to  proceed  with  the  next  iteration. 

We  shall  consider  each  of  these  steps  in  turn  for  the  (i  +  1  )th  iteration:  we  have  estimates 
Au"’,  Ac”1,  and  we  seek  improved  estimates  A u"b+1),  Ae^’+1^.  We  put 

Aiii  =  Aii;‘(i+1)  -  Au”‘,  Ae,;  =  Ae"(i+1)  -  Ac”‘. 


(23) 
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The  functional  U°  is  then  written  as 


U°n  =  /  W°{ Ac"'  +  ActJ)dK  -  /  ^(Au?  +  Au,-)^  -  /  P”(A<*  +  Afi,)dS.  (24) 
Jv  J  V  J  Sp 

6.1  The  predictor  step 

In  the  predictor  step  we  simplify  the  nonlinear  problem  by  introducing  a  quadratic  ap¬ 
proximation  to  the  function  W°.  Some  flexibility  is  permissible  in  the  choice  of  the 
approximation:  we  write 


^n(A«*i'  +  Ae,j)  =  W°(Ae"')  + 


aw; 


dAti, 


Acn  1 
u 


Ac ij  +  ~  C-ji-iAiijAtki 


1 


-  ^(Ae”’)  +  oVAia  +  -  C‘jk,AeijAe 


ki- 


(25) 


Substituting  W°  in  equation(25)  for  W°  in  equation  (24)  we  may  define  a  new  functional 
U°'  given  by 


K  =  KW 


Acnt 


)  +  tc 


(26) 


where 

u°  =  Jv~  C;jklAitJAiu  dV-^jv  F?A-Ui  dV  +  Js  p?  Aui  dS  -  ^  a"‘  Ae0-  dV^  .  (27) 

The  least  value  of  U°  gives  values  Afq,  Ae,;  and  hence  improved  estimates  Aujl*'+1>, 
Aejtv+1)  from  equation  (19).  What  is  significant  is  that  U°  is  a  quadratic  function,  and 
hence  the  problem  of  finding  its  least  value  is  a  linear  problem.  A  variety  of  choices  of  C’jhl 
can  be  made:  these  will  be  discussed  in  detail  when  we  refer  to  the  rate  of  convergence 
of  the  algorithm. 


6.2  The  corrector  step 

We  note  that  in  the  expression  for  0°,  given  in  equation  (27),  we  need  to  know  F™,  p", 
which  are  part  of  the  data,  and  <7*’,  which  is  not  given.  Thus  before  advancing  to  the 
next  iteration  we  must  compute  this  is  the  corrector  step. 

The  stress  depends  on  the  strain  at  the  end  of  the  n-th  time  step ,  and  the  strain 

at  the  end  of  the  (i  -I-  l)-th  iteration  for  the  (n  +  1 ) - 1 h  time  step,  or  the  total  accumulated 
increment  Ac”  6+1).  The  accumulated  increment  is  assumed  to  take  place  along  the  path 
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designated  in  Section  4,  and  the  constitutive  equations  are  integrated  along  this  path.  We 
note  thus  that  the  computation  of  is  a  local  calculation  at  each  point  in  the  body. 

If  the  backward  difference  assumption  is  adopted,  integration  of  the  constitutive  equations 
reduces  to  the  problem  of  determining  AefJn(,+I\  since  we  can  put 


n(t+l)  _  (n- 

aij  ~ 


1)  +  CijW(Ae^i+1>-A4r(’+1) 


), 


where  Cijki  =  Ajl;  are  ^ie  elastic  constants. 

It  is  convenient  to  define  an  elastically  predicted  stress  <7"eI,+  1)j  defined  by 


(28) 


nE(  t+I) 
U 


= »ir,) 


(29) 


Then,  if  H™  are  internal  variables  associated  with  the  i  —  th  iteration,  we  note  that 


<7#i+1)  =  <r^E(i+1),  Ae£l(<+1)  =  0,  if  ^(a1n/(,+I), //”')<  0.  (30) 

If  4>(cr"?^+1\  //"’)  >  0,  then  Ae-”l(l+1*  0.  The  computation  of  is  then  generally 

represented  graphically,  and  is  referred  to  as  the  return  algorithm.  The  simplest  case,  that 
of  an  elastic,  perfectly  plastic  material,  is  shown  in  Figure  3.  The  minimum  work  path, 
as  described  earlier,  is  made  up  of  an  elastic  change  from  to  er”(l+1\  followed  by 

plastic  strain  changes  at  constant  stress  if  ,+1*  lies  on  the  yield  surface.  In  Figure  3, 
for  the  case  where  Aef"*!+t)  ^  0,  we  show  the  vectors  —  <xj”  ,))  and  C,jki Aefn,t+I), 

which  make  up  cr”Ebfl>  It  can  clearly  be  seen  that  if  (7"£(,+1)  js  given,  we  can  drop  a 
perpendicular  back  onto  the  yield  surface  in  order  to  find  Similar,  slightly  more 

elaborate  rules  of  the  same  type  can  be  constructed  for  hardening  materials,  for  both 
continuously  differentiable  and  piecewise  continuously  differentiable  yield  surfaces. 


Figure  3:  Return  algorithm  for  an  elastic  perfectly  plastic  material 


3 !  4 


We  emphasise,  however,  that  the  return  algorithm  is  not  a  heuristic  choice,  but  can  be 
set  up  rigorously  for  any  elastic-plastic  material  in  terms  of  the  minimum  work  path. 
Alternatively,  in  the  internal  variable  formulation  the  return  algorithm  corresponds  to 
a  local  minimum  principle  [20].  Piecewise  linear  yield  surfaces  are  included  in  these 
formulations. 


7  Convergence 

The  steps  of  the  iterative  algorithm  are  showm  diagrammatically  in  Figure  4.  The  sketch 
shows  the  function  U°  we  set  out  to  minimise,  together  with  the  i  —  th  estimate  of  the 
solution.  The  approximating  quadratic  function  U°'  is  also  shown,  leading  to  the  (i+l)—th 
estimate.  The  corrector  step  permits  us  to  re-establish  the  new  estimate  on  the  original 
functional  U°. 


Figure  4:  Steps  in  the  iterative  algorithm 


Thus  at  the  end  of  the  two  steps  of  the  iteration  we  can  compute  f/°(Au’,*,+1*,  Ae^,+l*). 
A  comparison  of  £/°(Au"',  Ae-*’)  and  t/£(Au"(t+1*,  Ae,"*,+1))  provides  us  with  information 
on  the  convergence  of  the  algorithm:  it  is  clear  that  a  sufficient  condition  for  monotonic 
convergence  is  that 


f/n°(A<<‘+I>,  Ae"(i+,))  <  t/n*(A<\  Ac” '). 


Further,  since  by  the  definition  of  the  process, 


(32) 
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it  follows  that  the  sufficient  condition  for  monotonic  convergence  is  met  if 


f/°(A<(’+1),  Ae"(,+1))  <  f/"(Att"(,+,\  Ae"J(,+1*) 


(33) 


In  geometrical  terms,  this  implies  that  sufficient  conditions  for  monotonic  convergence 
can  be  met  if  the  least  value  of  the  functional  U°‘  lies  within  the  functional  U°.  Whether 
or  not  this  occurs  depends  on  the  choice  of  the  predictor  modulus  C'l]kl. 

It  can  be  shown  that  this  condition  is  met  if  C’jkl  is  chosen  as  the  elastic  constants  CtJki 
[13,16].  However,  the  disadvantage  of  this  choice  is  that  the  rate  of  convergence  is  very 
slow.  Improvements  can  be  made  by  choosing  the  secant  modulus  [13]  or  a  softened  elatic 
modulus  7  where  0.5  <  7  <  1  [24].  The  improvements,  however,  are  not  substantial, 
and  the  rate  of  convergence  remains  slow. 

The  rate  of  convergence  can  be  increased  substantially  if  the  consistent  tangent  predictor 
[25]  is  chosen:  here  we  put 


d2W° 


dAcijd&tkt 


A<n  1 


(34) 


We  note  that  if  Ac]"11  =  0, 


d2W° 


dAe.jdAew 


—  Cijkl  1 


At"  ’ 

V 


(35) 


i.e.  the  consistent  tangent  predictor  simply  the  elastic  modulus.  It  is  also  important 
to  note  that  in  the  first  iteration  in  a  load  increment  Ac]"1  —  0  at  all  points  in  the  body, 
and  hence  the  elastic  moduli  are  chosen  as  the  elastic  predictor  for  this  iteration.  This 
ensures  convergence  in  the  first  step  even  when  the  load  increment  represents  unloading. 

In  general,  however,  the  consistent  tangent  predictor  does  not  meet  the  sufficient  condition 
for  monotonic  convergence  of  equation  (33).  This  follows  because  the  least  value  of  the 
functional  U°l  does  not  necessarily  lie  within  the  functional  (/°,  and  therefore  constraint 
(33)  is  not  met  in  all  circumstances. 


8  Line  search  algorithm 

Convergence  of  I  h“  algorithm  with  the  consistent  tangent  predictor  can  be  assured  if  a 
line  search  algorithm  [25,26]  is  adopted.  In  our  present  context,  this  can  be  demonstrated 
by  simple  geometric  arguments. 

In  the  line  search  algorithm  we  compute  Au,,  Acq  as  before,  but  we  consider  improved 
estimates 
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A u"(!+1>  -  Au“‘  +  aAiii,  A(”!,  +  l!  =  At"'  +  aAtl}.  (36) 

where  a  >  0  is  a  scalar  multiplier. 

Consider  first  the  approximating  quadratic  function  U°\  given  in  equations  (26)  and  (27). 
We  may  put 


(/°'(Au,n‘  -f  aAu„  Ae"-‘  +  aAijj)  — 

t£(Au?\  Ac"')  +a2Jvl-  CijH AiijAiu  dV 

-a  |  Jv  F?Aut  dV  +  Js  p*Au,  dS  -  ajAe^  cfV  J  .  (37) 

As  expected,  this  function  has  its  least  value  for  a  =  1;  note  also  that  U£'(A u"1  + 
aAu,,  Ae^'  -f  aAijj)  —  (/“( Au"‘,  Ae^‘)  for  a  =  0  and  a  =  2.  The  function  is  shown 
diagrammatically  in  Figure  5. 


t 

» 


Figure  5:  Convergence  of  the  line  search  algorithm 

Now  consider  the  nonlinear  functional  t/^(Au"’  +  aAu,,  Ac"'  +  rAt.j).  First,  we  note 
that 

fT(a  -  0)  -  V°n'{a  =  0).  (38) 


Further,  we  see  that 


31? 


dUZ 

da 

The  function  U°(a)  is  also  plotted  diagrammatically  in  Figure  5.  The  geometry  of  this 
function  permits  us  to  reach  two  conclusions.  First,  the  least  value  of  will  occur  for 
a  >  0,  and  will  be  less  than  L'°{A u”‘,  Ac”*),  following  from  equations  (38)  and  (39). 
Second,  U°(a  =  1)  may  be  less  than  or  greater  than  U°(a  =  0).  Thus,  if  the  line  search 
algorithm  is  not  routinely  used,  the  iteration  may  or  may  not  provide  an  improved  estimate 
in  the  sense  of  an  improved  value  of  V°.  However,  if  U°(a  =  1)  >  !  ’"( «  =  0).  it  is  clear 
that  there  exists  a  value  a',  0  <  a1  <  1,  for  which  U°{a  =  a’)  <  U°(a  =  0). 

In  general,  the  application  of  the  line  search  algorithm  involves  finding,  at  least  approxi¬ 
mately,  the  value  of  a  which  provides  the  least  value  of  F“(c).  T  his  may  occur  for  a  <  1 .  or 
for  «  >  1:  in  either  case  monotonic  convergence  is  assured  and  the  optimum  improvement 
Fir  the  iteration  is  found 


dm 


da 


a=0 


(39) 


9  Spatial  discretisation 

With  the  framework  described  above  in  place,  we  may  now  consider  the  implementation 
of  the  finite  element  discretisation.  This  is  applied  to  the  weak  form  of  equations  (26) 
and  (27),  i.e.  the  minimisation  of  U°'  or  U°.  This  is  a  quadratic  form,  and  standard  finite 
element  discretisation  procedures  may  be  applied.  The  discrete  form  of  equation  (27)  will 
be 

=  l-AuT K' Mjl  -  Aut R\  (40) 

where  Aw  is  the  vector  of  improvements  in  the  nodal  displacements,  K'  is  the  consistent 
tangent  matrix  and  Rl  is  the  residual.  The  tangent  matrix  K'  will  be  symmetric  if  C‘jU 
is  symmetric,  as  is  the  case  with  the  choices  of  C'l)kt  which  have  been  discussed.  It  may  be 
noted  that  the  magnitude  of  the  residual  depends  on  the  difference  between  the  external 
nodal  forces  associated  with  F”,  p"  and  the  internal  nodal  forces  associated  with  <7™. 
When  the  external  and  internal  nodal  forces  are  in  equilibrium,  FV  =  0  and  equation  (40) 
gives  Aii  =  0,  indicating  that  the  iteration  is  complete. 


10  Generalised  trapezoidal  rule 

The  development  of  the  solution  algorithm  has  been  presented  in  terms  of  the  adoption 
of  the  minimum  work  path  as  the  strain  path  for  the  increment.  This  is  equivalent  to  a 
backward  difference  assumption  in  the  approximation  of  the  plastic  strain  over  t  he  interval 
(see  equation  21 ). 

Experience  has  shown  that  the  backward  difference  assumption  is  robust  and  effective 
[27].  It  also  has  the  advantage  of  providing  a  symmetric  consistent  tangent,  stiffness  mat  rix 
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and  unconditional  B  -  stability.  Nevertheless,  the  backward  difference  approximation  is 
associated  with  linear  rather  than  quadratic  convergence.  First  attempts  to  formulate  a 
trapezoidal  rule  led  to  non-symmetric  operators. 

Ortiz  and  Popov  [28]  proposed  a  general  class  of  integration  algorithms,  including  both 
a  generalised  mid-  point  rule  and  a  generalised  trapezoidal  rule.  These  rules  do  nut 
provide  symmetric  operators,  except  for  the  special  case  of  the  backward  difference  rule 
in  each  case;  this  is  a  major  disadvantage.  Sirtio  and  Govindjee  [29]  have  shown  that 
a  midpoint  rule  can  be  formulated  which  provides  both  quadratic  convergence  and  a 
symmetric  operator.  It  can  be  shown  simply  that  this  is  equivalent  to  a  generalised 
trapezoidal  rule  with  a  symmetric  operator. 

Full  details  of  the  development  have  been  given  by  Rencontre,  Oaddemi  and  Martin  [19] 
and  Caddemi  [21].  The  assumed  strain  path  over  the  interval  remains  tin*  same:  the 
initial  stress  is  <r\}  ,  and  the  final  stress  is  cr"  .  The  strain  path  is  made  up  of  t  wo  parts, 
and  elastic  part  and  a  part  where  plastic  strain  takes  place  with  a  fixed  direction  in  strain 
space.  In  computing  the  strain  increment,  however,  we  include  a  part  of  the  plastic  strain 
at  the  end  of  the  previous  increment  and  a  part  of  the  plastic  strain  at  end  of  the  current, 
increment.  Thus,  we  put 

-  or];-1))  +  (1  -  a) Arf -')  +  aAc?;\  (-11 ) 

where  0  <  a  <  1 . 

When  a  =  1,  we  recover  the  backward  difference  formulation.  The  rule  is  shown  diagram- 
matically  in  Figure  6  for  a  perfectly  plastic  material.  Note  that  (1  -  r>)A(fJ<n~1)  is  known 

from  the  previous  increment;  if  cr*]1-11  lies  inside  the  yield  surface  A<;(n_1>  will  of  course 
be  zero.  This  is  the  essential  difference  from  the  trapezoidal  rule  of  Ortiz  and  Popov  [28], 
where  the  direction  of  the  plastic  strain  associated  with  was  assumed  known,  but 

not  its  magnitude. 

We  note  also  that  Arf;  will  be  zero  if  cr"  lies  within  the  yield  surface.  In  the  case  of 
unloading,  where  Ae;ln-I)  is  not  zero,  we  see  that  Ae,n  may  thus  involve  plastic  strain 
projected  from  the  previous  increment.  This  suggests  that  the  generalised  trapezoidal 
rule  may  not  be  optimally  accurate  for  unloading  steps,  where  the  backward  difference 
assumption  is  more  effective. 

The  operator  for  the  generalised  trapezoidal  rule  so  defined  is  symmetric,  is  uncondition¬ 
ally  B  -  stable  for  a  >  j,  and  provides  second  order  accuracy  for  o  it  is  indeed 

directly  equivalent  to  the  generalised  midpoint  rule  of  Simo  and  Govindjee. 
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Figure  6:  Generalised  trapezoidal  rule  for  an  elastic  perfectly  plastic  material 


11  Conclusions 

A  coherent  overall  framework  for  the  iterative  solution  of  the  incremental  elastic-plastic 
problem  which  is  based  on  the  physical  concepts  of  extremal  paths  and  piecewise  holo- 
nomic  materials  has  been  presented. 

The  underlying  physical  concepts  allow  several  insights  which  are  not  accessible  in  a  purely 
mathematical  treatment,  and  work  is  continuing  to  attempt  to  exploit  these  insights. 
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Abstract 

Nuclear  fusion  technology  is  currently  making  remarkable  progress,  and  the  structural  design  of 
the  experimental  reactors  is  becoming  one  of  the  most  crucial  problems  in  this  technology.  The 
present  paper  is  concerned  with  the  constitutive  modeling  of  inelastic  behavior  and  damage 
evolution  of  materials  under  thermal  and  neutron-irradiation  conditions  applicable  to  the  rational 
structural  analysis  of  nuclear  reactor  components.  After  discussing  the  mechanisms  and  the 
physical  theories  of  neutron  irradiation  reported  so  far,  a  constitutive  model  of  creep,  swelling  and 
damage  under  irradiation  was  developed.  It  was  assumed  that  the  creep  under  irradiation  can  be 
decomposed  into  irradiation-induced  creep  and  irradiation-affected  thermal  creep.  The  applicability 
of  the  proposed  constitutive  equations  was  demonstrated  by  analyzing  the  creep,  swelling  and  creep 
damage  of  type  316  stainless  steel  under  various  conditions  of  irradiation.  As  an  application  of  the 
proposed  equations,  creep  crack  growth  under  neutron  irradiation  was  also  analyzed. 


1.  INTRODUCTION 

Polycrystalline  metals  exposed  to  the  irradiation  of  fast  neutron  at  elevated  temperatures  not 
only  show  salient  intrinsic  phenomena  of  irradiation  growth,  swelling,  irradiation  creep, 
irradiation  damage,  etc.,  but  also  are  subjected  to  significant  influence  on  their  mechanical 
behaviors  of  deformation  and  fracture  under  stresses  [1-4].  Fuel  cladding  tubes  of  FBR(Fast 
Breeder  Reactors)  [5-7],  for  example,  are  irradiated  by  fast  neutron  flux  of  the  energy 
E>0.1MeV  at  the  temperatures  of  400-700°C,  and  their  mechanical  behaviors  have  been 
investigated  extensively. 

Another  and  more  important  objective  of  irradiation  effects  is  the  plasma  facing  components 
of  fusion  reactors.  Hitherto,  the  central  problem  of  the  nuclear  fusion  has  been  concentrated  to 
the  realization  of  the  fusion  reaction.  However,  a  plasma-burn  experiment  due  to  D- 
T(Dcutcrium-Tritium)  fusion  reaction  was  demonstrated  in  the  JET(Joint  European  Torus)  in 
November  1991  for  the  first  time  in  the  history,  and  the  success  of  this  experiment  has 
confirmed  the  feasibility  of  the  controllable  nuclear  fusion  reaction.  This  success  implies  that 
the  problems  of  nuclear  fusion  arc  coming  to  the  engineering  stage,  and  the  intensive 
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international  collaborations  for  the  conceptual  and  engineering  design  of  the  experimental  fusion 
reactors  have  been  already  effected. 

The  Tokamak  type  fusion  reactor  of  ITER(Intemational  Thermonuclear  Experimental 
Reactor)  project,  in  particular,  is  supposed  to  have  the  burn  time  of  1000  sec  and  bum  cycles  of 
105,  and  the  neutron  energy  and  the  neutron  wall  loading  are  14.1MeV  and  lMW/m2, 
respectively.  If  Type  316  stainless  steel  is  employed  as  the  first  wall  material,  it  will  be  subject 
to  the  maximum  temperature  of 400°C  and  the  displacement  damage  of  31.4  dpa  during  the 
whole  life;  the  material  will  undergo  swelling,  irradiation  creep,  irradiation  damage  under 
considerable  cyclic  thermal  stresses. 

The  recent  development  of  the  nuclear  fusion  technology  is  making  the  structural  design  of 
the  experimental  fusion  reactors  one  of  the  most  essential  problems  in  that  technology.  Though 
a  number  of  investigation  have  been  performed  on  the  creep  and  damage  under  neutron 
irradiation,  they  are  mainly  concerned  either  with  the  kinetics  of  the  interstitial  atoms,  vacancies 
and  dislocations,  or  with  crystal  physics  and  metallurgy,  and  thus  continuum  mechanics 
theories  on  the  constitutive  behavior  of  material  under  irradiation  have  been  still  undeveloped. 

The  present  paper  is  concerned  with  the  mechanical  modeling  of  creep,  swelling  and 
damage  of  poly  crystalline  metals  under  thermal  and  neutron  irradiation  conditions  applicable  to 
the  structural  analyses  of  nuclear  reactor  components  under  multiaxial  states  of  stress.  A 
constitutive  equation  of  creep,  swelling  and  damage  under  irradiation  will  be  formulated  in  the 
framework  of  continuum  mechanics,  and  by  taking  account  of  physical  mechanisms  of  neutron 
irradiation.  Continuum  damage  mechanics  will  be  employed  to  describe  the  irradiation  damage 
and  final  fracture  of  materials.  It  will  be  assumed  that  the  creep  under  irradiation  is 
decomposed  into  irradiation-induced  creep  and  irradiation-affected  thermal  creep.  The 
proposed  constitutive  equation  was  applied  to  analyze  the  creep  and  damage  of  type  316 
stainless  steel  under  various  conditions  of  irradiation.  Finally,  the  utility  of  the  proposed 
equations  was  demonstrated  by  analyzing  the  creep  crack  growth  by  means  of  local  approach  of 
fracture  based  on  finite  element  method. 

Since  the  irradiation-induced  creep  is  weakly  dependent  on  temperature  [2,  8],  the  effect  of 
temperature  variation  will  be  disregarded  in  this  paper.  Though  the  bubble  formation  at  grain 
boundaries,  driven  by  helium  generation,  may  affect  the  mechanical  behavior  of  austenitic 
stainless  steels  at  high  temperatures  [3,  8],  it  will  not  be  taken  into  account  because  of  the  lack 
of  experimental  data. 


2.  MECHANISMS  AND  PHYSICAL  THEORIES  OF  IRRADIATION-RELATED 
CREEP 

Preceding  to  the  continuum  mechanics  modeling  of  creep  and  damage  subject  to  the 
irradiation  effects,  let  us  first  review  the  current  understanding  of  the  physical  mechanisms  and 
mechanical  aspects  of  the  relevant  phenomena. 


2.1.  Mechanisms  of  irradiation  related  creep 

When  a  fast  neutron  collides  with  an  atom  in  a  crystal,  the  atom  is  knocked  on  from  the 
lattice  cite,  and  gives  a  pair  of  an  interstitial  atom  and  a  vacancy  {Frenkel  pair).  If  the  primary 
knock-on  atom  has  sufficiently  high  energy,  the  atom  will  hit  another  atom,  initiates  a  sequence 
of  collision  and  knocking-on  of  the  constituent  atoms,  and  thus  induces  a  collision  cascade  in 
the  crystal  [1,  3].  Then,  a  part  of  the  resulting  interstitial  atoms  and  vacancies  may  annihilate 
by  recombining  with  each  other  or  by  clustering,  or  may  be  absorbed  by  sinks  such  as  voids, 
dislocations  and  grain  boundaries  [1-4]. 

Interstitial  atoms  have  larger  mobility  than  vacancies.  A  part  of  interstitial  atoms  coalesce  to 
form  dislocation  loops,  or  are  absorbed  by  existing  dislocations  to  facilitate  their  climb.  The 
annihilation  or  absorption  of  interstitial  atoms  increases  the  number  of  lattice  sites,  and  brings 
about  dilatation  (swelling).  The  excess  vacancies,  on  the  other  hand,  aggregate  to  form  voids, 
and  contribute  to  void  swelling.  Thus  when  polycrystalline  metals  are  subjected  to  irradiation  at 
elevated  temperatures,  in  addition  to  the  ordinary  thermal  creep,  the  coalescence  and  the 
absorption  of  interstitial  atoms  together  with  the  aggregation  of  vacancies  give  rise  to 
irradiation-induced  creep  and  swelling. 

Though  the  motion  of  dislocations  under  stress  may  be  accelerated  by  their  facilitated  climb 
due  to  the  absorption  of  interstitials,  they  may  be  decelerated  at  the  same  time  by  the  formation 
of  dislocation  loops,  dislocation  networks,  vacancies,  clusters  and  precipitates  brought  about 
by  the  irradiation.  The  ordinary  thermal  creep  caused  by  stress-controlled  dislocation  motion, 
therefore,  is  significantly  affected  by  the  neutron  irradiation.  Thus,  the  creep  under  irradiation 
may  be  divided  into  irradiation-induced  creep  and  irradiation-affected  thermal  creep.  The 
dilatational  part  of  the  irradiation-induced  creep  is  identified  as  swelling.  The  term  irradiation 
creep  will  represent  the  creep  which  is  obtained  by  subtracting  the  stress-free  swelling  from  the 
irradiation-induced  creep. 

2.2.  Physical  models  of  irradiation-induced  creep 

Though  a  number  of  physical  models  have  been  proposed  so  far  to  describe  the  creep  under 
irradiation,  they  have  been  focused  mainly  on  irradiation-induced  creep,  or  irradiation  creep  and 
swelling  [1-3].  One  of  the  most  successful  models  to  describe  the  irradiation-induced  creep  is 
the  Climb-Enhanced  <7//rfe(CEG)  mechanism,  originally  due  to  Ansell  and  Weertman  [9]. 
Absorption  of  interstitial  atoms  brought  about  by  ‘u.?  W-H^tion  facilitates  the  c!:mb  and  glide  of 
dislocations.  By  postulating  the  operation  of  the  Frank-Read  dislocation  source  and  assuming 
that  the  creep  rate  is  proportional  to  the  climb  velocity,  we  have  the  creep  rate  as  follows  [2,3]: 

e  =  k(o2 /Gl)S  (1) 

where  (')  is  the  time  derivative,  and  k ,  cr  and  S  stands  for  a  material  constant,  modulus  of 
rigidity  and  the  swelling  rate.  This  mechanism  postulates  the  flow  of  excess  interstitials  into 
dislocations,  and  the  creep  rate  is  proportional  to  the  rate  of  point  defect  nucleation,  and  hence 
to  the  swelling  rate  S .  By  assuming  that  the  climbed  dislocation  segment  can  bow  and  glide 


326 


according  to  the  Mott-Friedel  equation  for  dislocation-bowing,  Gittus  (10,  1 1 J,  furthermore, 
developed  the  following  equations: 


e  =  k(cr  !G)S 

(2) 

4  =  WDom 

(3) 

where  eijt  oDij  and  ^are  the  strain  rate  tensor,  deviatoric  stress  tensor  and  the  dislocation 
flux,  respectively. 

Heald  and  Speight  [12],  on  the  other  hand,  by  taking  account  of  the  elastic  interaction  of  a 
point  defect  with  the  strain  field  around  an  edge  dislocation,  proposed  Stress-Induced 
Preferential  Absorption  (SIPA)  mechanism.  Namely,  the  magnitude  of  the  long  range  force 
between  an  edge  dislocation  and  an  interstitial  atom  depends  on  the  relative  orientation  between 
the  directions  of  applied  stress  and  the  Burgers  vector,  and  the  dislocations  of  different  Burgers 
vectors  have  different  rates  of  interstitial  flux.  Therefore,  dislocations  of  the  preferential 
onentation  may  easily  absorb  interstitials,  easily  climb  and  give  rise  to  the  macroscopic  creep 
rate  in  the  corresponding  direction.  Since  the  anisotropic  rate  of  dislocation  climb  is 
proportional  to  the  rate  of  nucleation  of  point  defects  JQ, the  creep  rate  is  finally  given  as  follow 
(1,2,  12]: 

£  -  kK^o  (4) 

The  SIPA  mechanism  has  been  ascertained  to  explain  a  number  of  important  features  of  creep 
under  irradiation,  especially  when  the  swelling  is  not  significant.  Since  the  CEG  and  SIPA 
mechanisms  are  not  mutually  exclusive  but  may  operate  concurrently,  the  total  creep  rate  can  be 
given  by  the  combination  of  these  two  mechanisms  [13]: 

£  -  kpS  +  k2Kp  (5) 

Finally  the  nucleation  of  irradiation-induced  interstitial  loops  also  depends  on  the  direction 
of  the  applied  stress;  they  nucleate  preferentially  on  crystallographic  planes  nearly  perpendicular 
to  the  applied  stress.  Brailsford  and  Buliough  [14],  for  example,  assuming  this  SIPN  ( Stress- 
Induced  Loop  Nucleation )  mechanism,  derived  the  following  equation: 

£  -  kfi  pDoS 

where  b  and  Po  are  the  Burgers  vector  and  the  dislocation  density  of  the  interstitial  loops.  The 
SIPN  mechanism  is  valid  for  low  dislocation  density,  but  has  a  serious  restriction.  Namely, 
according  to  this  model,  once  a  prefered  loop  orientation  is  nucleated,  the  strain  rate  will  be 
controlled  by  the  irradiation  dose  rate,  but  is  not  affected  further  by  the  applied  stress. 

The  swelling  rates  S  in  eqs.  (1),  (2),  (5)  and  (6)  have  been  given  by  the  rate  theory  based 
on  the  diffusion  rates  and  density  of  interstitials  and  vacancies  [1,  3,  13],  or  by  the  empirical 
equations  expressed  in  terms  of  stress,  flux  of  irradiation  and  temperature  [2,  7,  15]. 


3.  MECHANICAL  MODELING  OF  CREEP,  SWELLING  AND  DAMAGE 

UNDER  IRRADIATION 

In  nuclear  reactor  components  operating  at  elevated  temperatures,  besides  the  irradiation- 
induced  creep  discussed  above,  the  stress-controlled  thermal  creep  under  irradiation  is  also 
predominant.  Thus  for  the  engineering  design  of  the  reactor  components,  we  need  not  only  to 
reformulate  the  above  irradiation-induced  creep  in  terms  of  mechanics,  but  also  to  develop  a 
rational  constitutive  equation  which  unifies  the  irradiation  creep,  swelling,  thermal  creep  and 
damage  and  can  be  easily  incorporated  into  the  computer  algorithms. 

For  this  purpose,  we  will  note  that  the  creep  under  irradiation  e'  can  be  divided  into  the 
irradiation-induced  creep  £ *  and  the  irradiation-affected  thermal  creep  ,  the  former  is 
caused  exclusively  by  the  irradiation  and  may  occur  even  under  vanishing  stress,  while  the 
latter  is  the  ordinary  thermal  creep  but  accelerated  or  decelerated  by  irradiation. 

3.1.  Mechanical  modeling  of  irradiation-induced  creep  [16-18] 

For  the  mechanical  modeling  of  irradiation-induced  creep  £tj  ,  let  us  first  postulate  two 
important  mechanisms  of  CEG  and  SIPA.  According  to  these  mechanisms,  as  described  in  the 
preceding  sections,  the  rate  of  irradiation-induced  creep  etf  will  be  specified  by  the  current 
state  of  stress  cr  and  the  current  rates  of  nucleation  of  interstitial  atoms  and  vacancies,  with 
respect  to  the  current  state  of  internal  structure  of  the  material  developed  by  the  irradiation.  The 
rates  of  nucleation  of  interstitial  atoms  and  vacancies  will  be  proportional  to  the  neutron  flux  <p, 
and  the  state  of  the  internal  structure  of  the  material  will  be  identified  by  the  densities  and 
configurations  of  interstitial  atoms,  vacancies  and  dislocations  brought  about  by  the  irradiation. 
Since  the  dislocation  density,  above  all,  is  identified  by  the  neutron  fluence  <I>  =  /  4>  dt  [2,  20], 
we  will  assume  that  the  internal  state  of  material  can  be  described  by  <t>.  The  rate  of  irradiation- 
induced  creep  etj  may  be  represented  as  follows: 

«if  “  Ptt{°kbb*)  (7) 

In  contrast  to  the  dislocation-controlled  thermal  creep,  the  distribution  of  dislocations  and 
the  internal  structure  of  the  material  caused  in  the  irradiation-induced  creep  will  have  not 
significant  anisotropy.  Thus,  in  view  of  the  linear  stress  dependence  of  CEG  and  SIPA 
described  by  eqs.  (2)  and  (4),  the  general  form  of  eq.  (7)  may  be  expressed  as  an  isotropic 
tensor  function  of  stress  a ’y  of  the  order  one  and  zero. 

+£(&<* >yartkSif  +^,4>)cr/  (8) 

where  symbols  rj,  f,  and  £arc  material  functions  of  <p  and  <h,  and  <5, ;  denotes  the  Kronecker 
delta.  If  we  decompose  the  stress  and  strain  into  isotropic  and  deviatoric  parts,  eq.  (8)  leads  to 
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(9b) 

(9c) 


4C=377  +  (3£  +  0au.5 

£Dij  ”  ^aDij 

where  e1^  and  aDij  are  the  deviatoric  parts  of  e"c  and  aIJf  respectively. 

The  symbol  sik  of  eq.  (9)  stands  for  the  volumetric  strain  raie,  and  hence  corresponds 
exactly  to  the  swelling  rate  5.  Therefore,  in  view  of  the  incubation  period  of  swelling  [2,  7, 
21]  and  the  linear  dependence  of  creep  and  swelling  on  the  neutron  fluence  4>  [2,  21],  and 
employing  the  transient  expression  of  Bates  and  Korenk  [7],  we  finally  have  the  constitutive 
equation  of  irradiation-induced  creep  and  swelling  (16, 18]: 

^ofj  ”  (1  /  3)*5<5,7  +  (  3  /  2)  P <P°Dij  (10a) 

S  -  C[1  -  *«*-*>  /{l  +  e *(*-*>}]  0  +  Qo* )  <p  (10b) 

where  C,  P,  Q  and  R  are  material  functions  of  0  and  O,  and  x  stands  for  the  incubation  fluence 
for  swelling.  Where  symbol  (  )  ,  furthermore,  represents  the  Macauley  bracket. 

3.2.  Mechanical  modeling  of  irradiation-affected  thermal  creep  [16, 18] 

Irradiation-affected  thermal  creep  is  brought  about  by  the  ordinary  thermal  creep  affected  by 
the  irradiation  effects.  The  thermal  creep  of  polycrystalline  metals  at  elevated  temperatures,  on 
the  other  hand,  is  induced  by  the  diffusion  controlled  dislocation  motion  activated  by  stress, 
and  is  usually  accompanied  by  creep  damage;  the  creep  damage  occurs  as  a  result  of  the 
nucleation  and  growth  of  grain  boundary  cavities  brought  about  mainly  by  the  grain  boundary 
sliding  [22]. 

By  examining  the  results  of  creep  tests  under  irradiation  [2,  6,  23],  and  by  comparing  the 
results  with  those  of  the  ordinary  unirradiated  tests,  the  effects  of  the  concurrent  neutron 
irradiation  on  creep  may  be 

1)  acceleration  and  deceleration  of  dislocation  motion  as  a  result  of  the  nucleation  of  point 
defects,  void  clusters,  dislocation  networks  and  the  precipitation  of  inclusions. 

2)  deceleration  of  nucleation  and  growth  of  grain  boundary  voids  and  the  delay  of  the  onset 
of  the  tertiary  creep  [6],  probably  due  to  the  supply  of  interstitial  atoms  from  rich  source  of 
point  defects. 

These  features  imply  that  the  mechanisms  of  thermal  creep  and  creep  damage  per  se  are  not 
changed  by  the  irradiation,  and  the  rates  of  creep  and  damage  will  be  subject  to  quantitative 
change  depending  on  the  neutron  flux  <f>  and  the  neutron  fluence  d>. 

Thus,  if  we  postulate  the  Kachanov-Rabotnov  creep-damage  theory  [24]  together  with  the 
strain-hardening  hypothesis  of  McVctty  type  creep  law  [25],  and  assume  that  the  effects  of 
irradiation  on  the  constitutive  and  the  evolution  equations  can  be  described  through  the  change 
of  their  material  constants,  we  have  [16, 18] 

“(3/2 )/aVa''<-,%  +  (3/2).4(^,<I>)[o'E)3 /(I - D)]n(*'<t’)oDij  /(I -D) 

D- D)\  *(#'“>) 


(11a) 

(lib) 
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where  oW  and  oEQ  are  the  maximum  principal  stress  and  the  equivalent  stress.  The  symbols  A, 
B,  it  and  k  are  material  functions  of  0  and  4>,  while,-!*,  o*  and  n*  arc  material  constants  for  the 
transient  creep  law  [25].  The  symbol  D  in  eq.(ll),  furthermore,  is  an  internal  state  variable 
describing  damage  state  (damage  variable),  and  defined  as  follows: 

D  -  0,  at  t  =  0  (initial  undamaged  state) 

D  =  1,  at  t  =  tR  (final  ruptured  state)  (12) 

According  to  Gilbert  and  Chin's  experimental  data  for  20%  cold  worked  type  316  stainless 
steel  at  650°C  [5]  (see  Fig.  3  below),  no  significant  difference  is  observed  in  the  stress- 
dependence  of  rupture  times  among  unirradiated  creep,  creep  under  irradiation  and  post- 
irradiation  creep;  this  implies 

*(*,*) -*o  (13a) 

As  regards  the  creep  exponent,  the  results  observed  in  Reference  [2]  allows  us  to  assume 
h(0,<J>)  =  /jo  (13b) 

Since  the  material  function^  (<p,  <£)  is  related  to  the  creep  rate,  it  depends  on  the  behavior  of 
irradiation-induced  internal  defects.  The  effects  of  fluence  $  on  the  microstructure  of  the 
material  is  cumulative,  and  develops  dislocation  structures.  The  dislocation  density  and 
configuration  will  tend  to  an  asymptotic  state  as  <t>  increases  [2,  20].  The  effect  of  <p  ,  on  the 
other  hand,  is  brought  about  by  the  interaction  of  the  irradiation-induced  defects  with  the 
existing  defects  [26].  Namely  such  interaction  is  governed  by  the  rate  of  increase  of  point 
defects  induced  by  the  irradiation  together  with  the  rate  of  their  annihilation  due  to  their 
absorption  by  the  existing  sinks,  such  as  point  defect  clusters,  dislocation  networks,  etc.  Thus, 
sufficiently  large  value  of  0  again  leads  to  a  stationary  defect  structures  [26]. 

Furthermore,  the  function  5(0,  is  related  to  the  rate  of  creep  damage.  The  creep 
damage  in  polycrystalline  metals  is  brought  about  by  the  formation  and  the  development  of 
microcavities  on  the  grain  boundaries,  which  are  governed  mainly  by  the  local  diffusion  of 
atoms  and  the  matrix  creep  [22].  The  effect  of  on  the  material  is  embrittlement  due  to  the 
formation  of  point  defect  clusters  and  the  sessile  dislocation  networks.  The  effect  of  0,  on  the 
other  hand,  supplies  interstitial  atoms,  which  in  turn  decelerate  the  formation  of  grain  boundary 
cavities  and  retards  the  material  damage.  Increase  in  both  of  0  and  <!>,  therefore,  will  lead  to  an 
asymptotic  state  of  damage  rates.  In  view  of  these  microstructural  mechanisms,  we  will 
assume  the  following  functions  for.-!  and  B: 

A(4>,<*>)  -  4,11  +  «,0  -  ea>*)][l  +  «3(  1  -  ea'*)\  (13C) 

m*)  =  B0fl  +*>,(1  ~ebi*W  +&3O  ~eK*)\  (13d) 

where  a,,  a2,  a3,  bj,  b2,  b3  and  b4  are  constants  that  can  be  identified  by  comparing  the  creep 

curves  of  unirradiated  creep,  creep  under  irradiation  and  of  post-irradiation  creep  under  constant 

stress. 
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3 3.  Constitutive  equation  of  creep,  swelling  and  damage  under  irradiation 

Summarizing  the  results  of  the  preceding  discussions,  the  constitutive  equations  of 
irradiation  creep  and  swelling  arc  given  by  eqs.  (10),  (1 1)  and  (13)  as  follows  [16, 18]: 


■c  inc  ,  • uc 

£ij  m£ij  +£a 

=  (3/2 )A  o 

+  (3/2).40[l  +a,(l-e‘a^)][l  +a3(l  ~ £  a'^ )Ma sq D)fa'\cr /{\  -  D)\ 

(14a) 

D  -  B0[  1  +  ^(1  -  )][1  +  b3(  1  -  e-** *)][a(1)  /(l  -  D)f° 

(14b) 

S  «  C[l  -  /(l  +  **u‘#))](l  +  )  <t> 

(14c) 

By  taking  <p  -  O  -  0,  eq.  (14)  is  reduced  to  the  thermal  creep  under  unirradiated  condition, 
while  the  cases  of  <p  *  0,  O  •fipdt  and  <j>  -  0,  <J>  *  0  account  for  the  creep  under  irradiation  and 
the  post-irradiation  creep,  respectively.  Thuscq.  (14)  furnishes  a  unified  constitutive  equation 
for  unirradiated  creep,  creep  under  irradiation  and  for  post-irradiation  creep. 

3.4.  Analysis  of  creep  under  irradiation  of  20%  cold-worked  type  316  stainless  steel 

To  evaluate  the  validity  and  the  applicability  of  eq.  (14),  we  will  now  perform  the  following 
analyses  for  20%  cold  worked  type  316  stainless  steel  at  650°C. 

Material  constants  in  eq.  (14)  can  be  identified  by  the  procedure  discussed  in  the  preceding 
papers  [17,  18],  and  can  be  determined  by  use  of  the  experimental  results  of  References  [5,  6, 

20,  21,  23].  The  material  constants  of  eq.  (14)  thus  determined  are  as  follow: 

A0  -  3.20  x  10‘13MPa"n°  hr'1,  ^  -  3.50,  B0  -  6.00  x  10'10MPa'k“  -  hr1,  k0  - 
A *  - 1.30  xio-7 MPa'0',  a  -  8.50  x 10'3hr'\  n  -  2.00 

1.80 

(15a) 

X  “  5.00 x  lO^n’cm'2,  C  -  4.00  x  10_25(n  cm'2  •  hr)'1  •  hr"J, 

Q  -  4.75  x  10~3  MPa'1,  R  -  1.25  x  10_22(n  •  cm'2)’1 

(15b) 

a, -5.00  x  10  2 ,  a2 -2.60  x  10'19(n-cm'2 ’hr)1,  b,  = -0.95, 
b2  =  4.5  x  10"19(n  cm'2  ■  hr)'1,  P  -  2.50x  lO'^MPa  *  n  •  cm"2  •  hr)'1 

(15c) 

a3  -  -9.00  x  10'2,  a4  -  2.60  x  10'2!(n  •  cm-2)’1, 
b3  -  1.30,  b4  - 2.60  x  10'21(n  cm'2)'1 

(15d) 

where  the  material  constants  of  eqs.  (15a)  through  (15d)  were  identified  by  the  test  results  of 
unirradiated  creep,  swelling,  creep  under  irradiation  and  postirradiation  creep,  respectively. 

Fig.  1  shows  the  results  of  prediction  of  swelling  for  20%  cold  worked  type  316  stainless 
steel  at  400°C.  The  solid  lines  and  the  symbols  arc  the  predictions  of  eq.  (14)  and  the 
corresponding  experimental  results  of  Porter,  Takata  and  Wood  [21  j.  The  calculations  were 
performed  by  use  of  material  constants  related  to  stainless  steel  at  650°C,  and  the  material 
constants  of  eq.(15)  were  modified  by  use  of  temperature  dependence  of  swelling  identified  by 
the  experiments  [51. 


Swelling 
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Fig.  1  Swelling  of  20%  cold  worked  type 
316  stainless  steel  at  40(fc 
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Fig.2  Creep  curves  of  unirradiated,  post¬ 
irradiation  and  under-irradiation  creep  for 
20%  cold  worked  type  316  stainless  steel 


As  shown  in  Fig.l,  cq.  (14)  represents  the  continuous  transition  from  the  incubation  to  the 
steady-state  period  without  changing  the  characteristic  of  incubation  and  steady-state  periods. 
Fig.l  suggests  the  validity  of  eq.  (14). 

Figs.  2  and  3  show  the  comparison  between  the  experimental  results  [5]  for  unirradiated 
creep,  post-irradiation  creep  and  creep  under  irradiation,  and  the  corresponding  predictions  of 
cq.  (14).  The  symbols  in  the  figures  stand  for  the  results  of  experiment,  and  the  temperature, 
stress  and  irradiation  conditions  are  entered  in  the  figures. 

Fig.  2,  to  begin  with,  represents  the  creep  curves  for  different  conditions  of  irradiation.  It 
will  be  observed  that  the  constitutive  equation  (14)  describes  accurately  the  essential  features  of 
irradiation  creep;  i.e.,  creep  curves  under  irradiation  including  transient  creep  stage, 
embrittlement  due  to  prior  irradiation,  and  the  retardation  of  the  start  of  the  tertiary  creep  and 
increase  of  tensile  ductility  due  to  neutron  flux.  Fig.  3,  on  the  other  hand,  shows  the  stress- 
rupture  time  relations  for  unirradiated,  post-irradiation  and  under-irradiation  creep,  predicted  by 
cq.  (14).  It  will  be  observed  from  Fig.3  that  cq.  (14)  describes  the  dependence  of  rupture  times 
on  the  irradiation  conditions  observed  in  the  corresponding  experiments. 


4.  ELABORATION  OF  CONSTITUTIVE  EQUATIONS  FOR  CONSIDERABLE 
VARIATION  OF  STRESS  STATES 

4.1  Elaboration  based  on  creep-hardening  surface  model  of  variable  stress  creep  [19] 
Components  of  nuclear  structures  are  often  subject  to  large  change  of  stress  states,  not  only 
in  their  magnitude  but  also  in  their  direction.  The  first  wall  of  fusion  reactors,  for  example,  is 
subject  to  reversed  thermal  stress  due  to  the  intermittent  bum  of  plasma.  This  change  in  stress 
induces  significant  transient  increase  of  creep  rates,  and  has  a  large  influence  on  the  initiation 
and  the  growth  of  creep  cracks  in  the  materials.  The  irradiation-affected  thermal  creep  of 
eq.(ll)  is  based  on  the  strain-hardening  theory  of  creep,  and  cannot  be  applied  adequately  to 
the  problems  of  large  stress  variation  [25].  Thus  we  will  extend  the  creep-hardening  surface 
model  of  variable  stress  creep  [27]  to  include  the  effects  of  irradiation  and  material  damage. 

In  the  case  of  the  classical  strain-hardening  theory,  in  particular,  the  thermal  creep  rate  £,yC 
is  expressed  as  a  function  of  stress  and  the  history  of  creep  strain  ejf  as  follows  [27]: 

if 9 -(2/3 (16a, b) 

where  oEQ  and  aDlj  denote  the  equivalent  stress  and  the  deviatoric  stress,  respectively.  The 
symbol  q  in  eq.(16a),  furthermore,  is  a  creep-hardening  variable  (i.e.  an  internal  state  variable) 
representing  the  state  of  creep-hardening  of  the  material. 

The  conventional  theory  represented  by  eqs.(16a,b),  however,  cannot  describe  the  transient 
increase  of  creep  rate  which  is  observed  after  stress  reversals  or  change  in  stress  direction  [27], 
because  the  creep-hardening  variable  q  cannot  adequately  describe  the  recovery'  of  material 
hardening  after  the  stress  change.  In  order  to  overcome  this  difficulty,  Murakami  and  Ohno 
[27]  introduced  a  hyper-sphere  (creep-hardening  surface)  in  a  creep  strain  space  as  follows: 


g-(2/3)(elc-ail)(s!ic-aiJ)-p2^0 


(17a) 


where  a  .  and  p  denote  center  and  radius  of  the  hyper-sphere,  respectively.  The  evolution 
equations  of  these  variables  arc  given  as  follows: 


f(l-A)^ 

£  =  0  and  (Jg/Jtfyf  >0 

a,  - 

1" 

£<0  or  (dgldefyf*  0 

f(2/3)'"Af,fn4 

g  “  0  and  (%/^,[c)ff  >0 

p  « - 

1° 

£<0  or  (dg/def)i?-*  0 

”,  = 

(<7  -  a,  )/[(<7 

~akl)i£ld  “^tf)] 

(17b) 

(17c) 

(17d) 


where  A  is  a  material  constant  specifying  the  rate  of  development  of  p.  Thus,  the  creep- 
hardening  variable  q  representing  the  state  of  creep  hardening  under  a  given  deformation  history 
is  specified  by  use  of  atj  and  p  in  eq.(17)  as  follows: 

<?  =  [1/(2A )][/>+ (ff  -or ^oaj  I o eq )]  (18) 

If  we  employ  the  Bailey-Norton  creep  equation  of  constant  uniaxial  stress 

eTC-Aontm  (19a) 

and  postulate  the  strain-hardening  hypothesis,  we  have  the  following  equation  of  muitiaxial 
creep  [27]: 

kf  -  (3/2 )mAVna%Wm'1),m(oai /orEQ)*(3/2)f(aEQ;  q)(oaj /o^)  (19b) 

where  A,  n  and  m  denote  material  constants. 

In  view  of  the  concept  of  continuum  damage  mechanics,  the  damage  state  of  material  under 
creep  can  be  expressed  by  a  damage  variable  D.  If  we  employ  the  creep  damage  theory  of 
Kachanov-Rabotnov,  eq.  (19b)  can  be  extended  to  incorporate  the  effects  of  creep  damage  as 
follows: 

-  (3/2)rnAVmqlm-}),m[oEQl(l  -D)f  /(!-/>)],  D  -  fi[a(,,/(l-  £>))  *  (20a, b) 

where  om  denotes  the  largest  principal  stress,  and  B  and  k  are  material  constants.  The 
hardening  parameter  in  eq.  (20a)  can  be  given  by  eqs.  (17)  and  (18). 

Since  the  essential  mechanisms  of  irradiation-affected  thermal  creep  are  unchanged  from 
those  of  the  creep  and  damage  under  unirradiated  condition,  the  constitutive  equation  of 
irradiation-affected  thermal  creep  efj  and  the  related  evolution  equation  of  creep  damage  can 
be  provided  from  eqs.  (17), (18), (20)  by  replacing  the  coefficients/!,  m,  n  B,  k  and  A  with  the 
corresponding  material  function  of  <f>  and  $>. 

To  recapitulate  eqs.  (14),  (17)-(19)  and  the  above  argument,  the  constitutive  equations  of 
creep,  swelling  and  damage  under  irradiation  under  muitiaxial  and  variable  states  of  stress  are 
expressed  as  follows: 


(21a) 


=  (l/3)S£g  +{2>  l2)P<fxJDij 

+  (3 / 2)m0.-l (^, O )’ /,n 0 f/(m° ' l> ' [ <t£(2 /(I  -  D)p-m*>,m°{oDij/(\ -  D)\ 


9-[l/(2A0)][p  +  (c.7C  - 

(2  lb) 

[(1  ~  ^o)0/ 

g-0  and  (%/<c)Hc>0 

a, .  -  ■{ 

11  |0 

g<0  or  (Sgl s0 

(21c) 

.  J(2/3?*V?A 

g  *  0  and  (dg/fcf  Kf  >  0 

Ho 

g  <0  or  (Sgldsf)E™-  a  0 

(2  Id) 

au)(*u  ~au)t' 2 

(21c) 

D«#(0,<l>)[a(1)/(1-D)]*0 

(210 

5  =  C[1-^u'4>)/(1+p 

(21g) 

where  A(4>,^)  and  are  again  given  by  eqs.  (13c)  and  (13d). 


4.2.  Analysis  of  creep  under  irradiation  subject  to  variable  states  of  stress 

The  material  constants  of  eqs.  (21)  can  be  determined  by  the  procedures  similar  to  that  of 
Section  3.4  and  Reference  [27],  The  material  constants  were  identified  for  the  20%  cold 
worked  type  316  stainless  steel  at  650°C  and  were  given  in  Reference  [19]. 


Fig.  4  Creep  strain  fjt  under  uniaxial  stress  reversals  between  (T, ,  =  70  MPa  and 
-70MPa  with  an  interval  of  1000  hours 


Figure  4  shows  the  predictions  of  cq.(2l)  for  the  creep  subjected  to  uniaxial  stress  reversals 
under  three  different  conditions  of  irradiation.  The  stress  changes  between  ax  [=70MPa  and 
-70MPa  with  the  intervals  of  1000  hours  as  shown  in  the  figure.  It  will  be  observed  that  cq. 
(21)  describes  well  the  transient  increase  of  creep  rate  after  stress  reversals  under  each 
irradiation  condition.  Conventional  strain-hardening  theory  expressed  in  cq.  (16),  in  general, 
cannot  describe  this  increase  in  creep  rate,  and  accordingly  underestimates  the  relaxation  of 
stress.  It  means  that  conventional  strain-hardening  theory  predicts  smaller  redistribution  of 
stress  in  the  materials. 

Rupture  time  under  stress  reversal  conditions  in  Fig. 4,  however,  becomes  about  twice  as 
long  as  the  rupture  time  under  the  constant  stress  conditions  of  Fig.l.  It  is  because  the 
maximum  principal  stress,  which  governs  the  process  of  creep  damage  [see  evolution  equation 
of  creep  damage  (2lf)]  is  zero  when  stress  is  negative  <7n=-70MPa.  Noting  the  behavior  of 
creep  under  irradiation,  the  absolute  value  of  the  creep  rate  under  positive  stress  rr,  i=70MPa  is 
found  to  be  larger  than  that  under  negative  stress  an=-70MPa  for  neutron  fluencc  larger  than 
<t>  =  5  x  l022n  •  cm  2 ,  and  this  can  be  accounted  for  by  the  swelling  at  a  constant  rate  after  the 
incubation  period.  Thus,  creep  under  irradiation  is  found  to  increase  with  stress  cycles  in 
contrast  to  the  creep  under  other  irradiation  conditions. 
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Fig.  5  Creep  strain  under  change  of  stress  direction  between  0°  and  150°  at  the  interval  of  1000  hours. 

(a)  Axial  creep  strain  .  (b)  Shear  creep  strain  2e{2  /(3)'n . 

Finally,  Figs.  5  (a),  (b)  show  the  axial  and  shear  creep  strain  predicted  by  cq.  (21)  under 
different  irradiation  conditions  brought  about  by  the  multiaxia!  and  non -proportional  loading. 
The  alternative  stress  change  between  0°  and  150“  in  cru  ~(3)u2ai2  stress  plane  under  constant 
equivalent  stress  of  70MPa  at  intervals  of  1000  hours  arc  shown  in  the  figure.  Unfortunately, 
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there  are  no  experimental  data  corresponding  to  Figs. 4  and  5  under  post-irradiation  and 
irradiation  conditions  to  confirm  the  validity  of  cq.  (21). 


5.  ANALYSIS  OF  CREEP  CRACK  GROWTH  UNDER  NEUTRON  IRRADIATION 

As  an  application  of  the  constitutive  equations  developed  so  far,  we  will  elucidate  the  effects 
of  neutron  irradiation  on  creep  crack  growth  observed  in  nuclear  reactor  components.  For  this 
purpose,  the  constitutive  equation  (14)  was  incorporated  into  a  computer  code  of  finite  element 
analysis  (FEM),  and  the  creep  crack  growth  in  a  plate  of  20%  cold  worked  stainless  steel  at 
650°C  under  unirradiation,  post-irradiation  and  irradiation  conditions  was  analyzed  {28]  by  a 
local  approach  to  fracture  in  continuum  damage  mechanics.  The  material  was  assumed  to  show 
elastic-creep  behavior.  When  the  damage  variable  D  of  an  element  calculated  from  eq.  (14b)  has 
attained  to  a  critical  value,  the  element  is  assumed  to  be  ruptured.  The  aggregation  of  these 
ruptured  elements  is  identified  as  a  macroscopic  crack. 

The  mesh  division  by  simplex  elements  employed  in  the  analysis  is  shown  in  Fig.  6.  The 
plate  is  subject  to  the  constant  distributed  stress  o0  in  the  direction  perpendicular  to  the  initial 
crack,  and  was  assumed  to  be  in  a  state  of  plane  stress.  Number  of  elements  and  nodes  in  Fig. 
6  are  485  and  286,  respectively.  The  material  constants  of  eq.(15)  together  with  the  following 
value  of  the  modulus  of  rigidity  were  employed: 

G=54.8GPa  (22) 


► 


x 


1  =  75 


Fig.  6  Finite  element  discretization  o'  a  cracked  plate  subject  to  uniform  tension 
under  neutron  irradiation 
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Fig. 7  shows  the  relation  between  the  incipient  creep  crack  growth  rate  v  and  the  tensile 
stress  crQ  under  different  conditions  of  irradiation  entered  in  the  figure.  The  conditions  of 
irradiation  correspond  to  a  fuel  cladding  of  fast  breeder  reactors.  The  corresponding  results  of  a 
simplified  analysis  by  a  double  cantilever  beam  (DCB)  model  (29]  were  also  entered  in  the 
figure  by  dashed  lines.  As  observed  in  the  results  of  FEM  analysis,  the  crack  growth  rate  under 
neutron  irradiation  is  about  1/20  of  that  of  unirradiated  condition.  The  crack  growth  rate  of  the 
post-irradiated  material,  on  the  other  hand,  is  5  times  as  large  as  the  unirradiated  condition.  It 
should  be  noted  that  though  the  creep  crack  rate  is  suppressed  by  the  irradiation  and  the  crack 
growth  under  irradiation  looks  more  stable,  it  will  be  accelerated  by  about  100  times  once  the 
neutron  flux  vanishes.  The  creep  crack  growth  rates  predicted  by  DCB  model  is  2-4  times 
larger  than  those  of  FEM  analysis,  but  shows  similar  results. 


Distributed  stress  o0  MPa 

Fig.  7  Relation  between  tensile  stress  and  crack  growth  velocity  under 
different  conditions  of  irradiation 


6.  CONCLUSIONS  AND  REMARKS 

To  facilitate  engineering  design  of  nuclear  reactor  components  under  intensive  irradiation, 
especially  of  future  fusion  reactors,  constitutive  models  unifying  irradiation  creep,  swelling  and 
thermal  creep  should  be  developed  on  the  sound  bases  of  physics  and  continuum  mechanics.  In 
the  present  paper,  a  constitutive  equation  for  this  purpose  was  developed  by  dividing  the  creep 
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rate  into  irradiation-induced  creep  and  irradiation-affected  thermal  creep,  with  special  emphasis 
on  austenitic  stainless  steels.  The  resulting  constitutive  equation  was  applied  to  analyze  the 
creep  crack  growth  under  neutron  irradiation  to  elucidate  the  effects  of  concurrent  and  prior 
irradiation  on  creep  crack  growth  rates. 

Though  a  great  deal  of  results  have  been  accumulated  so  far  in  the  physical  and  metallurgical 
investigation  on  the  mechanical  behavior  of  materials  under  neutron  irradiation,  they  hardly 
have  a  relation  with  the  current  development  of  the  theories  of  plasticity,  viscoplasticity  and 
damage.  To  narrow  the  gap  between  the  present  knowledge  of  irradiation  creep  and  the  current 
state  of  design  technology  developed  so  far  for  fusion  reactors,  the  collaboration  of  continuum 
mechanics  and  metallurgical  physics  will  be  able  to  do  a  significant  contribution. 
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Abstract 

This  lecture  addresses  energy- related  aspects  of  anelasticity.  Anelasticity  also  means  irreversi¬ 
bility  and  dissipation.  To  study  the  behaviour  of  irreversible  media  (anelastic  materials  and 
structures),  an  interesting  and  important  direction  is  to  consider  their  description  in  the  general 
framework  of  thermodynamics.  Considerable  progress  has  been  achieved  and  various  approaches 
are  proposed  in  this  direction.  The  most  usual  one  is  based  on  a  systematic  description  of  irre¬ 
versible  systems  via  the  expressions  of  two  potentials  :  the  free  energy  and  the  dissipative 
potential.  This  approach  is  associated  with  the  work  of  several  authors  in  different  domains 
such  as  metal  plasticty,  friction,  fracture  and  damage  mechanics.  Our  objective  is  to  present  this 
description  in  relation  with  common  and  practical  models  of  anelasticity.  Some  general  results 
of  the  literature  concerning  the  quasi-static  response  of  irreversible  systems  are  presented.  In 
particular,  when  the  behaviour  is  time-independent  (e.g.  plasticity,  contact  with  dry  friction, 
brittle  fracture  or  brittle  damage),  general  results  concerning  stability  and  bifurcation  analysis 
can  be  expressed  in  an  unified  framework.  This  framework  will  be  presented  and  illustrated  by 
recent  examples. 


1.  INTRODUCTION 

Thermodynamics  provide  the  largest  framework  for  the  description  of  the  irreversible  behavi¬ 
our  of  anelastic  media  (materials  or  structures).  In  this  direction,  considerable  progress  has  been 
achieved  and  various  approaches  have  been  proposed.  For  example,  the  functional  approach 
found  in  Coleman  &  Gurtin's  work  has  been  successfully  applied  to  visco-elasticity  in  the  fif¬ 
ties.  In  the  last  two  decades,  significant  results  have  been  however  obtained  within  the  so-called 
internal  variable  theory.  In  particular,  the  introduction  of  internal  state  variables  can  be  com¬ 
plemented  by  a  systematic  description  of  the  material  behaviour  via  the  expressions  of  two 
potentials  :  the  free  energy  and  the  dissipation  potential  (cf.  [1-3]  for  example).  This  approach 
is  associated  with  the  works  of  several  authors,  in  particular  Biot,  Mandel  [4-5]  in  visco-elasti¬ 
city,  Moreau,  Mandel,  Rice,  Halphen  &  Nguyen  ..  [6-9]  in  plasticity  and  visco-plasticity,  Lem- 
aitre  &  Chaboche  [10]  in  damage  mechanics.  ..etc.  It  provides  a  systematic  and  general  frame¬ 
work  for  the  description  of  usual  laws  and  for  the  mechanical  modeling  of  new  phenomena.  It 
is  now  widely  accepted  and  becomes  more  and  more  popular  for  the  description  of  anelasticity. 

Our  objective  is  to  present  this  approach  and  to  underline  some  general  results  concerning  the 
quasi-static  evolution  of  irreversible  systems.  In  particular,  when  the  behaviour  is  time-inde¬ 
pendent  as  it  is  in  the  context  of  plasticity,  contact  with  dry  friction,  brittle  fracture  and  brittle 
damage,  general  results  concerning  stability  and  bifurcation  analysis  can  be  expressed  in  an  uni¬ 
fied  framework.  These  results  are.  illustrated  by  some  recent  examples  borrowed  from  the  litera¬ 
ture. 
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2.  THERMODYNAMIC  FRAMEWORK 

The  notions  of  state  variables,  state  equations,  free  energy,  complementary  laws  and  dissipa¬ 
tion  potential  D  are  here  assumed  for  all  dissipative  systems.  This  approach  leads  to  the  defini¬ 
tion  of  specific  models  of  materials  and  structures  denoted  as  standard  materials  and  standard 
systems. 

2.1.  Standard  materials 

A  material  can  be  described  by  states  variables  q  =  (e,a)  where  «  is  a  deformation  and  a  den¬ 
otes  internal  parameters  .  By  unit  reference  volume,  the  associated  free  energy  is  W(e,a).  The 
dissipation  is  : 

d  =  a.e  -  W  (1) 

where  a  denote  the  dual  stress  associated  to  e.  Dissipation  represents  a  product  of  fluxes  and 
forces  : 

d  =  (a  -  <rR  ).«  +  A.a  (2) 

where  oR  and  A  denote  the  associated  forces  : 

crR  =  W,t  ,  A  -  -  W,a  (3) 

To  simplify  the  presentation,  it  will  be  accepted  here  that  the  deformation  is  not  a  dissipative 
mechanism,  all  dissipative  mechanisms  are  taken  into  account  by  a.  It  follows  then  from  (2) 
that : 

a  =  <rR  (4) 

To  complete  the  constitutive  equations,  complementary  relations  between  forces  and  fluxes 
must  be  introduced.  These  complementary  laws  of  evolution  can  be  conveniently  expressed 
under  the  form  : 

A  =  D,0  (5) 

or  under  the  dual  form  : 

a  =  D*,a  (6) 

where  D  (a,q)  denotes  the  dissipation  potential  ,  a  function  of  a  depending  eventually  on  the 
present  state  q,  D  is  obtained  from  D  by  Legendre  transform.  If  the  dissipation  potential  is 
assumed  to  be  convex,  the  notion  of  differentiation  in  (5)  can  be  understood  in  the  sense  of 
sub-gradients,  cf.  Moreau  [6],  and  this  enables  us  to  include  in  the  same  framework  the  impor¬ 
tant  case  of  non-differentiable  dissipation  potentials  associated  with  time- independent  dissipa¬ 
tive  processes. 

Finally,  the  constitutive  equations  of  a  standard  material  are  described  by  the  expressions  of 
two  potentials  W  and  D. 

This  description  is  sufficiently  large  to  include  most  models  of  anelasticity  : 

1  For  example,  in  Maxwell's  model  of  linear  visco-elasticity,  state  variables  are  (e,rv  ),  free 
energy  W(c,£v  )  =  ^  (e-«v  ).L.(e-«v  )  and  D  =  ^  £v  .N.ev  . 
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If  ev  is  replaced  by  cP  and  D  =  k  j«P  |  while  w  is  unchanged,  relation  (5)  gives  : 
a  =  k  if  £P  #  O  and  Id  <  k  if  eP  =  0 

I*p  I 

The  obtained  model  is  the  elastic-perfectly  plastic  material  with  Mises  criterion  and  normality 
law  : 

f(a)  =  |cr|  -  k  <  O  ,  <p  =  /i  f1(7  with  n  >  0,  f  <  0  ,  /r  f  =  0  . 

In  the  same  spirit,  the  model  can  be  extended  into  finite  strain  with  the  decomposition  F  =  Vu 

*  E.P  (cf.  [7,  11]): 

Mandel's  model  of  perfect  plasticity  consists  in  choosing  t  =  F  and  a  -  P  ,  W  =  W(E)  = 

W(F.P_1)  .  In  this  case,  a  -  W,F  is  the  unsymmetric  Piola-Kirchoff  stress  tensor  and  A  =  -W,p 

=  o.E  .  The  plastic  criterion  is  f (ip)  <0  ,  ip  denotes  the  stress  tensor  ip  =  detP  P.A  and  the 
flow  rule  is  P  =  n  f,A  with  f  =  f  (  detP  P.A  ). 

Standard  models  of  plasticity  have  been  much  discussed  at  small  strain.  Most  models  with 
associated  flow  rules  are  standard  models.  For  example,  Chaboche's  model  of  cyclic  plasticity 
[10]  is  an  extension  of  Ziegler-Prager  or  Mroz's  model  of  combined  isotropic  and  kinematic 
hardening  [12].  Camclay  model  is  a  popular  standard  model  in  soil  mechanics.  The  notion  of 
visco-plastic  potential  D*(cr)  has  been  discussed  in  particular  by  Rice  [8].  Standard  models  of 
damage  mechanics  are  considered  in  [10]  etc.... 

It  is  important  to  underline  that  the  description  of  standard  models  is  stable  with  respect  to 
the  choice  of  state  variables  and  with  the  expression  of  forces  and  fluxes. 

For  example,  if  an  arbitrary  regular  change  of  variable  p  =  p(a)  is  introduced,  the  new  asso¬ 
ciated  force  is  B  =  -  w,^  =  A  .  a,^  while  P  =  P,a  .  6  .  It  is  clear  that  A.a  =  B./I  and  all 
the  principal  ingredients  of  the  model  (free  energy,  forces,  dissipative  potential,  convexity) 
remain  available. 

Forces  A  and  rates  a  can  also  be  transported  by  convection  to  a  different  configuration  to 
new  forces  B  =  L(q).A  and  new  fluxes  b  =  L(q)T  ~l  .a  where  L(q)  denotes  a  state-dependent 
linear  operator.  The  dissipation  potential  D  can  be  then  written  in  terms  of  new  fluxes  as  it  has 
been  proposed  in  different  models  of  plasticity  at  finite  strain. 

If  e  =  A(t)  is  a  given  function  of  time,  the  following  equations  are  obtained  : 

A  =  -  W,a  (A, a)  =  D,a  (a,A,a).  (7) 

Thus  the  evolution  of  internal  parameters  a(t)  satisfies  a  differential  equation  with  initial  condi¬ 
tion  : 

W,a  +  D,a  =  0  ,  a(0)  =  a0  (8) 

which  can  also  be  expressed  in  an  equivalent  way  as  : 

A  =  -  W,a  (A(t),o) 

«  =  D  ,A  (A,A(t),a) 
a(0)  =  a0 


(9) 
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2.2.  Standard  systems 

Equations  (8)  also  describes  the  quasi-static  evolution  of  discrete  or  continuous  structures,  cf. 
Biot  [4], 

For  a  system  defined  by  parameters  q  =  (u,at)  where  u  are  reversible  parameters,  a  irreversi¬ 
ble  parameters,  and  A(t)  are  (displacement  or  force)  control  parameters,  let  W(q,A)  be  the  whole 
free  energy  of  the  system.  This  energy  results  from  the  contribution  of  reversible  energy  stored 
by  all  different  elements. 

If  the  system  is  dissipative,  let  us  introduce  as  in  the  previous  section  the  dissipation  potential 
D(a,q)  in  the  sense  that  : 

d  =  A  .  d  with  A  -  D,^  .  (10) 

The  equation  describing  the  dynamic  evolution  of  the  system  has  then  the  general  form  : 

W  +  D,q  +  F(q,A)  +  J  -  0  (11) 

q(0)  =  q®  ,  q(0)  =  p® 

where  F  denotes  external  forces  and  J  inertia  forces. 

If  the  external  forces  F  are  conservative,  by  definition  F  =  -  'J',q  where  ^(q,A)  denotes 
the  force  potential.  It  is  convenient  to  include  the  force  potential  in  the  expression  of  W 
(which  represents  then  the  total  free  energy  of  the  system)  to  write  the  quasi-static  equation 
of  evolution  in  the  previous  form  (8)  : 

W  +  D,_  *  0  (12) 

q(0)  =  q® 


3.  TIME-INDEPENDENT  PROCESSES 

3.1.  Quasi-static  evolution  of  time-independent  processes 
We  are  interested  principally  in  the  particular  case  of  time-independent  processes.  This  is  an 
important  case  since  it  includes  all  usual  descriptions  of  dry  friction,  plasticity,  fracture  and 
damage. 

Such  phenomena  are  associated  with  positive  homogenous  dissipation  potential  of  degree  one  : 

D(mq,q)  =  m  D(q,q)  if  m  >  0  .  (13) 

This  function  is  not  differentiable  when  q  =  0,  the  set  C  of  sub-gradients  at  this  point  is  a 
convex  set  in  the  force  space  A  and  represents  the  domain  of  admissible  forces.  This  domain  is 
state-dependent  C  =  C(q)  when  D  depends  on  the  present  state  q.  It  can  be  convenient  to  rep¬ 
resent  this  domain  C  by  inequalities  f(A,q)  <  0  . 

Conversely,  D  is  recovered  from  C  by  Hill's  principle  of  maximum  dissipation  : 

D(q,q)  =  Max  A*  q  (14) 

A  €  C 

In  the  spirit  of  the  previous  examples  of  plasticity,  equations  (14)  can  also  be  written  follow¬ 
ing  (1 1)  or  (13)  as  : 
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W,u 
A  = 
a  - 

q  (0) 


=  O  , 

-  W,a  , 
NC(<^  (A)  , 


(15) 


where  Nc  denotes  the  operator  of  normality,  Nc  (A)  is  a  normal  to  convex  C  at  point  A. 

In  (15),  the  first  group  of  equations  W,u  =  0  gives  the  possibility  to  eliminate  completely 
the  reversible  variables  u  as  u  =  u(a,A)  when  W,uu  is  not  singular.  This  yields  a  more  com¬ 
pact  equation  of  evolution  for  a  : 


A  =  - 
d  =  N 
<*(0)  = 


W, 


C(a.A) 
=  a” 


(A) 


where  W  =  W(a,A)  =  W(u(a,A),a,A)  is  the  reduced  expression  of  energy. 


(16) 


3.2.  Incremental  response 

It  is  well-known  that  normality  law  represents  an  incremental  law  : 

For  example,  in  plasticity,  fP  =  p  f,a  with  >  0,  f  <  0  and  pf  =  0  .  Plastic  multiplier  p  is  an 
additional  unknown  but  equality  p  f  =  0  provides  an  additional  equation.  If  r.h.s.  derivability 
is  assumed,  it  follows  that  p  f  +  p  f  =  0  .  Thus  if  the  plastic  criterion  is  satisfied,  i.e.  f  =  0, 
the  plastic  multiplier  p  is  then  given  by  equations  p  >  0,  f<  0  ,pf  =  0  .  This  is  the  usual  way 
to  address  elastic- plastic  response. 

However,  it  is  also  possible  to  express  the  same  result  directly  from  the  dissipation  potential. 
The  following  equivalent  equations  are  derived  from  normality  law  and  from  r.h.s.  derivability 
[13]: 

-  A.d  +  q  .  D,q  (d,q)  =0  (17) 

-  A  .6a  +  q  .  D,q  (6a,q)  >0  (18) 

V  iq  =  (5u,6a)  such  that  6a  is  an  admissible  rate  i.e.  a  vector  of  the  normal  cone  Sa  e  Nc 

(A)  . 

If  the  admissible  domain  C  does  not  depend  on  the  present  state  q,  equation  (17)  gives  the 
orthogonal  condition  A.d  =  0  which  is  for  example  the  well-known  relation  a  eP  =  0  of  perfect 
plasticity  at  small  strains. 

When  A  is  computed  from  A  =  -  W,0  ,  equations  (17)  and  (18)  lead  directly  to  the  follow¬ 
ing  characterization  of  the  rate  response  : 


The  rate  q  =  (u,a)  is  a  solution  of  the  variational  inequality  : 


(  w,uu 

.  u  +  W,ua 

•  «  +  W,uA  .A  ).  6u 

=  o 

(  W,ua 

•  «  +  w,„a 

•  “  +  )-  ( Sa 

-  d)  + 

q  .  (  D,q  (6a,q)  -  D,q  (d,q)  )  >  0 


(19) 


for  all  admissible  rates  6q  =  (Su,  6a). 
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3.3.  Some  general  results  on  the  incremental  response 
The  incremental  response  has  been  studied  in  the  context  of  plasticity  [14-17]...  or  for  contact 
problems  with  dry  friction  [18-19].  Variational  inequality  (19)  can  be  studied  by  standard  meth¬ 
ods  of  convex  analysis  [20].  The  following  remarks  are  straightforward  : 

-  This  variational  inequality  is  not  symmetric  when  D  uepends  on  u.  If  D  depends  on  or,  ii  is 
symmetric  if  and  only  if  the  term  6a  .  D,a  (a,a)  is  symmetric.  The  dependence  of  the  dissipa¬ 
tion  potential  on  the  present  state  is  thus  a  source  of  mathematical  difficulty.  It  is  useful  to 
recall  that  a  non-symmetric  variational  inequality  admits  no  associated  extremum  principle. 

-  Existence  of  a  rate  response  q  associated  with  an  arbitrary  implied  control  rate  A  is  ensured 
by  the  following  positivity  condition  : 

Sq  .  W,qq  .  5q  +  £q  .  D,q  (6a  ,  q)  >  0  (20) 

for  all  admissible  rates  £q  =  (Su.Sa)  . 

-  Uniqueness  of  the  rate  response  is  ensured  if  the  following  inequality  is  satisfied  for  any 
couple  of  admissible  rates  tfq1  *  Sq1  : 

(Sq*-Sq*)  .  W,qq  .  (5qi-Sq»)  +  (5q1-£q*)  .  (  D(Sai,q)  -  D(5a*,q))  >0  (21) 

-  Variational  inequality  (19)  is  a  mixed  formulation  in  which  the  principal  unknown  is  c*  = 
(u,a).  It  may  be  more  convenient  in  some  situations  to  eliminate  u  or  a  and  to  retain  only  a 
or  u  as  principal  unknown  : 

In  plasticity  for  example,  in  the  case  of  simple  plastic  potential,  the  plastic  strain  rate  can  be 
eliminated.  The  resulting  rate  displacement  problem  with  unknown  u  has  been  discussed  in 
detail  [14-17]  in  small  and  finite  strains.  The  associated  expressions  of  (20)  and  (21)  for  an  elas¬ 
tic-plastic  solid  is,  [15]  : 


6a  .  V5u  dV  +  $u  .  .  6u  >  0 

V 


(22) 


|  ^-^.(VSui-VSu2)  dV  +  (5u1-5u2)  .  ¥,uu  .  (fiu^fiu*)  >  0  (23) 

Jv 

for  all  admissible  rates  of  displacement. 

However,  in  the  case  of  multiple  plastic  potential,  the  plastic  criterion  is  represented  by  sev¬ 
eral  inequalities  and  such  elimination  is  not  possible.  It  is  thus  necessary  to  retain  as  principal 
unknown  the  couple  q  =  (u,«). 

Another  possibility  is  to  eliminate  u  and  keep  a  as  principal  unknown  following  the  reduced 
formulation  (16).  Such  formulation  has  been  discussed  by  Maier  [21]  in  plasticity  for  the  study 
of  the  quasi-static  evolution  and  of  the  asymptotic  behaviour  of  elastic-plastic  structures  such 
as  elastic  or  plastic  shake-down.  In  fracture  mechanics,  a  represents  the  crack  surfaces  or  crack 
lengths,  the  elimination  of  displacement  u  is  also  necessary.  The  expressions  of  (20)  or  (21; 
associated  with  the  reduced  formulation  (16)  are  : 


(24) 


6a  .  W,aa  .6 a  +  6a  .  D,a  (Sa,o t,A)  >  0 
(Sa'-Sa*)  .  W,aa  .  (ScH-Sa*)  + 


(25) 
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(6a1 -Sa*)  .  (  D,a  (5a1, a,X)  -  D,„  (6a*,a,A))  >  0 
3.4.  Bifurcation  and  stabi'ity  analysis 

It  has  been  established  by  Hill  [15]  that  positivity  condition  (23)  is  a  sufficient  condition  of 
non-bifurcation.  Bifurcation  analysis  in  plasticity  has  been  extensively  studied  [22-24]  and  most 
analyses  are  based  on  Hill's  result.  It  is  relatively  clear  that  if  two  different  rates  exist  then 
bifurcation  occurs  eventually.  However  the  possibility  of  smooth  bifurcation,  i.e.  two  responses 
with  same  tangent  but  different  curvatures  for  example,  can  also  be  i  scussed  in  the  light  of 
Hill's  criterion  (23)  by  considering  higher  order  terms. 

In  this  spirit,  conditions  (21)  or  (23)  or  (25)  represent  the  same  criterion  of  non-bifurcation. 

Condition  (22)  has  been  proposed  bv  Hill  [15]  as  a  sufficient  condition  of  stability  of  the  pre¬ 
sent  equilibrium  of  ar.  elastic-plastic  system.  It  appears  as  a  direct  generalization  of  the  well- 
known  criterion  of  second  variator)  in  elastic  buckling.  Much  discussions  have  been  devoted  for 
a  full  justification  of  Hill's  result,  cf.  [13,  25,  26]  in  connection  with  the  notion  of  dynamic 
stability  i.e.  stability  in  the  sense  of  Lyapunov. 

An  equilibrium  is  stable  if  small  perturbations  of  the  system  imply  only  small  perturbed 
motion  near  equilibrium.  The  energy  Ep  injected  into  the  system  by  perturbation  forces  during 
time  interval  [0,t]  is  : 


Ep  t,t)  =  K(t)  +  4>(t)  -  $(0)  with 

*(t)  =  W(q,A)  +  f  D(a,q)  dt 
Jt° 


(26) 


where  K(t)  >  0  denotes  the  kinetic  energy  and  $(t)  the  total  energy  which  is  the  sum  of  total 
free  energy  and  dissipated  energy  of  the  system  at  time  t.  In  general,  the  total  energy  is  a  path- 
dependent  functional. 

It  has  been  established  that  [26]  : 


$(t)  -  *(0) 


i  M  £  * 


higher  order  terms... 


(27) 


with  5*$ 


a  Vu  dV  +  u  .  ^,uu  .  u 
V 


Thus,  if  (22)  is  satisfied.  fhen  it  will  be  necessary  to  inject  energy  to  remove  the  system  from 
equilibrium.  The  system  will  have  no  natural  tendency  to  change  itself  without  external  actions. 
The  considered  equilibrium  satisfies  certain  notions  of  stability  qualified  as  static  stability  or 
directional  stability  [13,  19,  26]. 


To  obtain  a  complete  justification  of  dynamic  stability,  i;  is  necessary  to  derive  further  esti¬ 
mates  on  the  perturbed  motions.  References  [13,  25]  provide  some  discussions  on  »his  difficult 
subject.  In  particular,  when  dissipation  potential  D  is  state-dependent,  the  loss  of  symmetry  in 
the  rate  response  is  an  important  difficulty  and  the  problem  is  still  open  to  further  inve  liga¬ 
tions.  However,  if  D  is  state-independent,  then  dynamic  stability  is  also  ensured,  at  least  for 
discrete  systems. 

In  the  same  spirit,  conditions  (20)  or  (22)  or  (24)  give  the  same  criterion  of  (static  or 
directional)  stability.  When  D  is  state-independent,  the  considered  equilibrium  is  also  dynami- 


cally  stable,  at  least  for  discrete  systems.  Stability  criterion  (20)  or  (24)  has  been  discussed  for 
various  problems  of  plasticity,  friction,  damage,  fracture  [27,  28], 

Note  that  if  D  is  state-independent,  then  stability  criterion  (24)  is  : 

SA  .  Sa  >  0  V  6a  admissible,  where  6 A  =  W,aa  .  6a.  (28) 

It  is  important  to  underline  that  bifurcation  does  not  mean  exchange  of  stability  as  in  the 
classical  context  of  elasticity,  cf.  [29,  30],  This  result  is  well  understood  after  Shanley's  discus¬ 
sion  on  the  buckling  of  elastic-plastic  columns  (Fig.  1)  [13,  28).  Condition  (20)  for  example  is 
less  restrictive  then  (21). 


4.  ILLUSTRATIONS 

4.1  System  of  interacting  linear  cracks 

The  study  of  crack  propagation  and  stability  is  a  classical  problem  in  Fracture  Mechanics.  In 
brittle  fracture,  when  Griffith's  criterion  for  c  ack  propagation  is  assumed,  a  large  body  of 
research  work  has  been  dedicated  to  the  two-dimensional  problem  of  one  linear  crack  in  plane 
strain,  plane  stress  or  antiplane  shear.  Its  generalization  to  a  system  of  interacting  linear  cracks 
or  a  plane  crack  of  arbitrary  shape  has  been  also  discussed. 

An  elastic  solid  with  Griffith  cracks  is  a  system  undergoing  irreversible  transformation  subse¬ 
quent  to  crack  propagation.  Its  quasi-static  evolution  fits  perfectly  in  the  framework  of  time- 
independent  standard  dissipative  systems.  For  example,  the  quasi-static  behaviour  of  a  system 
of  interacting  linear  cracks  can  be  compared  to  the  quasi-static  behaviour  of  a  descrete  elastic 
plastic  structure.  The  equations  of  evolution  of  these  systems  are  of  the  same  mathematical 
nature. 


Consider  the  following  crack  propagation  problem  in  brittle  fracture,  [28,  32-34).  An  elastic 
solid  V  with  m  linear  cracks  of  lengths  t,  is  subjected  to  a  loading  path  defined  by  load  param- 
e  ers  A.  The  quasi-static  response  l(t)  associated  with  a  given  loading  path  A(t)  from  a  given 
initial  configuration  Iq  is  discussed.  The  cracks  are  assumed  to  propagate  in  their  direction  (no 
crack  kinking).  The  response  of  the  solid  is  reversible  if  there  is  no  crack  propagation.  The 
crack  lengths  t( t)  represent  clearly  a  system  of  geometric  parameters  describing  the  irreversible 
evolution  of  this  system.  It  is  thus  natural  to  take  l  as  parameters  a. 


Total  free  energy  of  the  system  is  : 


W(u,f,A) 


W(t(u))  dV  -  Td  u  dS 

Js-r 


Displacement  at  equilibrium  u  -  u(f,A)  can  be  eliminated  and  leads  to  the  definition  of  total 
free  energy  at  equilibrium  W(f,A)  =  W(u(f,A),f,A). 


Generalized  force  A  =  -  —  represents  the  energy  release  rate  G.  associated  to  the  i-th 

aei 

crack  length. 

In  brittle  fracture,  the  Griffith  criterion  consists  in  adopting  the  following  crack  propagation 
law  : 


If  G,  <  Gc  then  f,  =  0  (no  propagation) 

If  C,  =  Gc  then  f,  >  0  (possible  propagation) 


where  Gc  denotes  a  critical  surface  energy.  Most  often,  Gc  is  a  constant  of  the  considered  mat¬ 
erial.  In  some  applications  however,  Gc  may  also  be  considered  as  a  function  of  the  effective 
length  of  propagation  A£,  =  tx  -  £,°  in  order  to  take  into  account  the  stabilizing  effect  due  to 
the  near-tip  plastic  strain.  The  associated  dissipation  potential  is  : 

D('t,l)  =  Gc  (A lt  )ir  if  ir  >  0  ,  =  +  oo  otherwise 

I 

where  I  denotes  the  set  of  possible  propagating  cracks  i.e.  such  that  A,  =  Gc  (A^  ). 


The  rate  problem  is  now  described  by  variational  inequality  : 

^  >  0  ,  ie  I  and  satisfy  : 

(29) 

+  W,iA  .  X)(  Sti  -  ^  )  >  0 

where  =  W,jj  +  G',.  6^  . 

The  present  state  (2,A)  is  stable  in  the  dynamic  sense  if  the  following  condition  is  satisfied  : 

.  $5ij  .  5fj  >  0  V  St  *  0  ,  Si{  >  0  ,ie  1  (30) 

The  present  state  (/,A)  is  not  an  angular  bifurcation  state  if  : 

Matrix  <5,^  ,  (i,j  e  I)  is  positive-definite.  (31) 

If  Gc  is  a  constant,  the  stability  criterion  is  : 

-  SC  .  SI  >  0  for  any  SI  *  O,  >  O  ,  i  €  I  (32) 

where  SG  =  -  W ,n  .  SI  . 

For  example.  Fig.  2  represents  an  elementary  system  of  two  cracks  simulating  the  delamina¬ 
tion  of  laminated  composite  [35-38].  The  analytical  solution  can  be  constructed  and  compared  to 
experimental  results  [35],  It  may  be  interesting  to  compare  also  this  system  to  previous  Shanley's 
model  of  elastic-plastic  column  of  Fig.  1  . 

4.2  Propagation  of  a  plane  crack  or  a  damaged  zone 
The  propagation  of  a  plane  crack  of  arbitrary  shape  0  in  a  three-dimensional  solid  is  an 
important  problem  in  fracture  mechanics.  The  study  of  interface  cracks  in  thin  films  or  lami¬ 
nate  composites  provide  in  the  same  spirit  similar  examples  of  plane  cracks  of  arbitrary  shape. 
[35-40].  In  damage  mechanics,  some  models  of  brittle  damage  also  lead  to  the  extension  of  a 
damaged  zone,  [41-46], 


These  examples  are  here  considered  in  order  to  illustrate  general  results  when  the  irreversible 
parameter  a  is  a  plane  surface  ft  of  boundary  S  ,  cf.  Fig.  3.  The  solid  is  assumed  to  be  elastic, 

the  potential  energy  at  equilibrium  is  W(n,A)  =  W(u(fi,A),n,A)  with  : 

* 

W(u,n,A)  -  W(<(u))  dV  -  T(A).u  ds 

JV  JSf 

To  follow  the  genera!  description,  it  is  necessary  to  make  the  derivation  W,n  and  W,nn  by 

techniques  of  derivation  with  respect  to  a  domain.  It  is  established  that  : 
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W,Q  .  fft  -  -  G  5n  ds  (33) 

h 

where  G  denotes  the  energy  release  rate  at  a  point  of  the  moving  surface  S  and  represents  the 
local  value  of  generalized  force  A  associated  with  the  motion  of  O. 

For  a  plane  crack  in  a  three-dimensional  solid,  G  is  the  limiting  value  of  local  Rice-Eshelbv 
integrals  : 

G  =  J°  with  jo  =  Lim  j  (Wnx  -  n.o.u,n  )  dr  (34) 

r  =>o 

For  a  damage  zone,  the  expression  of  G  is  : 

G  =  [W  -  n.<7.u,n  ]  (35) 

For  a  delamination  crack  in  a  composite  plate  or  a  thin  film  : 

G  =  [W  -  n.N.u,n  -  n.M.Ww.n]  (36) 

The  second  derivative  of  energy  is  : 

sn  .  W,nn  .  $n  =  |  -  |  6G  +  ^  J  .  SCI  ds  (37) 

where  8G  denotes  the  variation  of  G  following  the  motion  6f2  of  n  and  R  is  the  curvature  of  S. 
A  symmetric  expression  of  the  second  derivative  can  be  obtained  from  the  expression  of  SG  in 
terms  of  5u(6fl),  cf.  [13].  The  boundary  perturbation  problem  to  obtain  6u(6ft)  is  a  familiar 
problem  of  shape  optimization. 

The  propagation  law  : 

If  G(s)  <  Gc  then  fi(s)  *  0  (no  propagation)  (38) 

If  G(s)  =  Gc  then  0(s)  >  0  (possible  propagation) 

is  associated  with  dissipation  potential : 

D(0,n)  =  Gc  fXs)  ds  for  IXs)  >  0  (39) 

JS 

When  Gc  is  a  constant,  the  total  energy  depends  only  on  the  present  state  : 

*(n.  A)  =  W(n,A)  +  Gc  dV  (40) 

Jn 

The  rate  problem  of  propagation  of  the  damaged  zone  Q  is  given  by  variational  inequality 
(19)  which  can  be  now  written  as  : 


Q  (s)  >  0  on  Sc  and  satisfies  V  6fl(s)  >  O  on  Sc  : 

(sn  -  ii) .  (*,nn  .  n  +  w,nA  .  x)  >  c 

where  Se  denotes  the  portion  of  S  such  that  G(s)  =  Gc 


(41) 
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Stability  and  bifurcation  again  can  be  discussed  as  in  previous  chapters.  For  example,  the  fol¬ 
lowing  proposition  is  obtained  : 

The  present  equilibrium  is  stable  in  the  dynamic  sense  if  the  quadratic  form  50  .  <&,nn  • 
is  positive  definite  on  the  set  of  admissible  rates  £0(s)  >  0  on  Sc  . 

The  stability  criterion  can  also  be  written  as  : 

-  SG  .  50  ds  >  C  for  any  50  #  O  such  that  60(s)  >  0  on  Sc  .  (42) 

Js 

As  an  illustration,  consider  the  torsion  of  a  brittle  elastic  cylinder  [46]  Fig.  4.  If  t  is  the  length 
of  the  cylinder  and  XI  the  torsion  angle  between  the  two  extreme  sections,  the  displacement  u  is 
given  by  : 


ux  =  -Ayz  ,  Uy  =  A  xz  ,  ut  =  A  rtfx,  y) 

where  rj{x, y)  denotes  the  warping  function.  The  expression  of  energy  is  : 


W(rj,0,A)  =  A2  id 


V-0 


[(*7,x  -  y)2  +  (n,y  +  x)2  ]  dv 


Equilibrium  equation  W>f?  =  0  leads  to 


At?  =  0  in  V-0  ,  =  nx  y  -  ny  x  on  S 


0. 


For  example,  for  a  cylinder  of  circular  section,  these  equations  give  the  trivial  equilibrium  r>  = 


The  dependence  dr?(dO)  can  be  obtained  directly  from  the  derivation  of  the  previous  equations 
with  respect  to  0.  The  following  equations  are  obtained  for  a  circular  section  : 


A(drj)  =  0  in  V-0  ,  »  R  dO,8 

on 


oo 

Let  dO(5)  =  da0  +  (  daj  cos  j&  +  dbj  sin  j 9  ) 

j=l 

be  the  Fourier  expansion  of  dO.  The  following  result  is  obtained  : 


dO  .  $,nn  .dO  =  2  A2  nt  2ttR2  J 


da02  -  ^  (j-!)  [  da}2  +  db^ 

j=l 


This  form  is  not  positive  definite  on  the  set  {  daj  ,  j  =  0,1,..  )  such  that  the  rate  dn(f?)  is  non¬ 
negative.  Thus,  trivial  equilibrium  is  not  stable. 
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Figure  1.  Plastic  buckling 
of  Shanley's  column  (a). 
Bifurcation  diagrams  are  given 
by  force-displacement  curves  (b) 
or  by  plastic  strain  curves  (c). 


Figure  2.  An  elementary  system  simulating 
delamination  of  composites. 


Figure  3.  Propagation  of  a  plane  crack 
of  arbitrary  shape  : 

(a)  in  a  three-dimensional  solid, 

(b)  in  the  interfaces  of  multilayered 
composites. 


Figure  4.  Torsion  of  a  brittle  elastic 
circular  cylinder.  Trivial  response  is 
instable  under  force  or  displacement  control 
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Abstract 

A  descriptive  treatment  of  light  emission  from  pulsating  bubbles  (sonoluminescence), 
sound  generation  by  rain  falling  on  water  and  breaking  waves,  and  the  propagation  of 
shock  waves  in  bubbly  liquids  is  presented.  The  first  section  contains  a  brief  survey  of 
some  occurrences  of  bubbles  in  literature  and  in  the  figurative  arts.  Considerations  on 
the  etymology  of  the  word  are  presented  in  the  appendix. 


1.  INTRODUCTION 

UoptfoXvf  o  ai/QpoTTos  -  Man  is  a  bubble  -  is  an  ancient  Greek  proverb  that  enjoyed  a 
singular  favor  in  the  western  culture  as  an  expression  of  the  caducity  and  impermanence 
of  human  life.  It  is  used  by  a  number  of  Latin  and  Greek  authors  such  as  Varro  who, 
apologizing  in  the  Preface  to  his  De  Re  Rustica  (36  b.c.)  that  the  work  is  not  as  polished 
as  he  would  like,  says  that  nevertheless,  being  80  years  old,  he  should  go  ahead  with  its 
publication  since,  “ut  dicitur,  si  est  homo  bulla,  eo  magis  senex”  (“if,  as  they  say,  man 
is  a  bubble,  all  the  more  so  is  an  old  man”).  Petronius  (1st  century  a.d.),  in  a  mocking 
passage  of  the  Satyricon  in  which  he  compares  man  to  inflated  walking  bags  and  flies,  also 
says  “nos  non  pluris  sumus  quam  bullis”  (“we  are  no  more  than  bubbles,”  42,  4).  Lucian 
(117-180)  elaborates:  “I’ve  thought  of  a  simile  to  describe  human  life  as  a  whole  . . .  You 
know  the  bubbles  that  rise  to  the  surface  below  a  waterfall  -  those  little  pockets  of  air 
that  combine  to  produce  foam?  . . .  Well,  that’s  what  human  beings  are  like.  They’re 
more  or  less  inflated  pockets  of  air  ...  but  sooner  or  later  they’re  all  bound  to  go  pop” 
( Charon  19). 

The  advent  of  Christianity,  with  its  message  of  hope  and  salvation,  rendered  the  idea 
less  relevant  and  probably  the  only  p re- Renaissance  textual  reference  to  it  is  in  the  Lexicon 
by  the  10th- 11th  century  Byzantine  scholar  Suida  (“as  a  bubble  immediately  disappears 
when  it  is  broken,  so  does  the  memory  of  the  splendid  and  powerful  upon  their  death”). 

However,  in  the  northern  post-reformation  cultural  climate  of  the  16th  and  17th  cen¬ 
tury,  the  metaphor  regained  its  appeal.  Most  succint,  -  and  first  -  is  Erasmus:  “Homo 
bulla”  [man  (is  a)  bubble,  Adagia,  1508,  n.  1990],  Taverner,  in  his  Proverbs  or  ada- 
gies  . . .  gathered  out  of  the  Chiliades  of  Erasmus  (1539)  echoes  the  concept,  and  so  does 
sir  Thomas  Elyot  (1545),  Golding  (“When  man  seemeth  to  bee  at  his  best,  he  is  al- 


Figure  1:  Engraving  after  a  1575  painting  by  Cornelis  Kete]  illustrating  the  Greek  proverb 
“Man  is  a  bubble.”  (Reproduced  from  Ref.  2) 


togither  nothing  else  but  a  bubble  blowen  togither  of  vanitie”  Psalms  xxxix  6,  II  155, 
1571),  Samuell  Rowlands  (“Poore  mortals  in  my  hands  are  brittle  ware,  Like  Vapor,  Bub¬ 
ble,  Flower,  wither’d  Hay,”  A  Terrible  Battell  betweene  the  two  consumers  of  the  whole 
World:  Time,  and  Death ,  p.  23,  1606)  Sir  William  Browne  (“What’s  he  . . .  whose  life’s 
a  bubble,  and  in  length  a  span;  . . .?  ’Tis  a  man,”  Britannia’s  Pastorals ,  Bk.  i,  song  2,  1. 
192,  1613),  Simon  Wasted  (circa  1628),  Jeremy  Taylor  (1651),  and  several  others. 

Sir  Francis  Bacon,  in  his  The  World  (1629)  expresses  a  more  radical  point  of  view. 
“The  world’s  a  bubble,”  which  Francis  Quarles  quickly  echoes  “My  soul,  what’s  lighter 
than  a  feather?  ...  This  bubble  world.  What  then  this  bubble?  Nought”  ( Emblems ,  Bk. 
i,  No.  4,  published  in  1635),  and  also  “The  pleasure,  honour,  wealth  of  sea  and  land  / 
Bring  but  trouble;  /  The  world  itself,  and  all  the  world’s  command  /  Is  but  a  bubble” 
(ibid.,  Bk.  i,  No.  6). 

Of  the  9  times  the  word  “bubble”  recurs  in  Shakespeare’s  theatre  (it  is  never  used 
in  his  poetry),  3  times  it  similarly  suggests  volatility  and  insubstantiality  such  as  “the 
bubble  of  reputation”  (/Is  You  Like  It  II,  vii,  139),  or  “A  dream  of  what  thou  wast  ...  a 
breath,  a  bubble”  ( Richard  III  IV,  iv,  83). 

The  same  notion  accompanies  the  word  throughout  the  subsequent  English  language 
literature.  William  Cowper  ( The  Task ,  published  in  1785,  Bk.  Ill,  1.  175)  admonishes: 
“Eternity  for  bubbles  proves,  at  last,  /  A  senseless  bargain.”  Samuel  Johnson,  in  an  ironic 
passage  thus  commiserates  the  fashion-conscious:  “Hard  is  his  lot,  that  here  by  fortune 
plac’d  /  must  . . .  chase  the  new-blown  bubbles  of  the  day.”  Alexander  Pope  is  most 
impartial:  “Who  sees  with  equal  eye,  as  God  of  all,  /  a  hero  perish,  or  a  sparrow  fall, 
/  atoms  or  systems  into  ruin  hurled,  /  and  now  a  bubble  burst,  and  now  a  world”  (An 
Essay  on  Man ,  1734,  Epis.  i,  1.  87)  arid  also:  “Like  bubbles  on  the  sea  of  matter  borne,/ 
They  rise,  they  break,  and  to  the  sea  return”  (ibid.  Epis.  iii,  1.  19).  Goleridge  thus  uses 
the  metaphor  to  express  a  lover’s  disappointment  “I  listen  for  thy  voice  /  Beloved!  ’t  is 
not  thyne;  thou  are  not  here!  Then  melts  the  bubble  into  idle  air,/  and  wishing  without 
hope  I  restlessly  despair”  (Blossoming  of  the  Solitary  Date-Tree,  st.  4).  His  contemporary 
Sir  Walter  Scott  joins  in  the  lover’s  desperation:  “Like  the  dew  on  the  mountain,/  Like 


357 


the  foam  on  the  river,/  Like  the  bubble  on  the  fountain,  /  Thou  art  gone,  and  for  ever!” 
( The  Lady  of  the  Lake ,  Canto  iii,  st.  16,  1810).  The  idea  can  be  followed  in  Lord  Byron: 
“The  eternal  surge  of  time  and  tide  rolls  on,  and  bears  afar  our  bubbles”  (Don  Juan , 
1824,  canto  xv,  st.  99),  and  all  the  way  down  practically  to  modern  times. 

It  does  not  seem  that  English  language  poets  ever  found  any  word  other  than  trouble 
to  rhyme  with  bubble.  An  early  example  by  Quarles  was  cited  before.  Shakespeare’s  is 
probably  the  most  well  known  one:  “Double,  double  toil  and  trouble,  /  fire  burn  and 
cauldron  bubble”  ( Macbeth  IV,  i,  10).  One  can  also  add  Dryden:  “War,  he  sung,  is  toil 
and  trouble;/  honor  but  an  empty  bubble”  (Alexander’s  Feast,  or  the  Power  of  Music 
1.  97),  the  Australian  poet  Adam  L.  Gordon:  “Life  is  mostly  froth  and  bubble;  /  Two 
things  stand  like  stone:  /  Kindness  in  another’s  trouble,  /  Courage  in  your  own”  (  Ye 
Weary  Wayfarer,  Fytte  viii),  and  even  the  peripheral  Nathaniel  Ward,  who  combines  it 
with  a  blatant  expression  of  anti-feminism:  “The  world  is  full  of  care,  and  much  like  unto 
a  bubble;/  Women  and  care  and  care  and  women,  and  women  and  care  and  trouble”  (  The 
Simple  Cobbler  of  Aggawam,  p.  25). 

While  the  metaphor  of  bubble  as  volatility  is  present  in  other  more  or  less  unrelated 
traditions,  such  as  in  the  12th-  13th  century  Japanese  writer  Kamo  no  Chomei  (“The  flow 
of  the  river  is  ceaseless  and  its  water  never  the  same.  The  bubbles  that  float  in  the  pools, 
now  vanishing,  now  forming,  are  not  of  tong  duration:  so  in  the  world  are  man  and  his 
dwellings”  Hojoki  -  An  Account  of  my  Hut ,  1212),  or  the  11th  century  Persian  poet  Omar 
Khayyam  (“A  deathless  Saki  draws  Khayyams  in  thousands/  like  wine  bubbles  out  of 
Creator’s  bowl,”  The  Rubaiyyat  st.  49,  transl.  Graves),  it  is  curiously  absent  from  other 
major  literatures.  The  closest  the  Bible  gets  to  the  idea  is  in  Proverbs,  21:  6  with  the 
words  hevel  niddaf  -  literally  “driven  vapor.”  The  New  American  Bible  translates:  “He 
who  makes  a  fortune  by  a  lying  tongue  is  chasing  a  bubble  over  deadly  snares,”  but  most 
other  English  versions  use  different  constructs.  For  example,  the  King  James  Bible  says: 
“The  getting  of  treasures  by  a  lying  tongue  is  a  vanity  tossed  to  and  fro  of  them  that  seek 
death.”  The  Septuagint  translation  is  far  from  literal,  with  the  words  OijattvpiapaTa 
paTOtta,  “vain  possessions.” 

Dante’s  Divina  Commedia  abounds  with  literal  bubbles.  The  violent  rulers  are  pun¬ 
ished  by  boiling  in  blood:  “lungo  la  proda  del  bollor  vermiglio,  /  dove  i  bolliti  facieno  alte 
strida”  [along  the  brink  of  the  vermilion  boiling,  /  wherein  the  boiled  were  uttering  loud 
laments  (trad.  Longfellow),  Inferno  xii,  1.  101]  and  the  barrators  (i.e.,  corrupt  officials)  by 
boiling  in  pitch:  “bollia  la  giuso  una  pegola  spessa,  /  die  ’nviscava  la  ripa  d’ogni  parte./ 
F  vedea  lei,  raa  non  vedea  in  essa  /  rnai  the  le  bolle  che  ’1  bollor  levava,  /  e  gonfiarsi  tutta 
e  risiedcr  compressa”  (was  boiling  down  below  there  a  dense  pitch  /  which  upon  every 
side  the  bank  belimed.  /  I  saw  it,  but  I  did  not  see  within  it  /  aught  but  the  bubbles  that 
the  boiling  raised,  /  arid  all  swell  up  and  resubside  compressed,  Inferno  xxi,  1.  17),  but 
the  Homo  bulla  idea  is  absent.  Similarly,  1  have  been  unable  to  find  any  reference  to  this 
notion  in  other  Italian  poets  or  in  the  French  or  Spanish  literature. 

An  interesting  and  direct  offshoot  of  Erasmus’s  exhumation  of  the  Greek  proverb  can 
however  be  found  in  the  Netherlands.  The  Remonstrant  minister  Uytenbogaert,  admon¬ 
ished  the  Stadholder’s  court  in  the  by  now  familiar  terms:  “Man  is  nothing  other  than  a 
bubble  that  children  blow  in  a  mussel  shell,  that  glistens  in  the  round  . . .  but  which  in 
an  instant  disappears  and  is  gone.”  Rembrandt  painted  a  portrait  of  Uytenbogaert  (sold 
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in  July  1992  at  Sotheby’s  for  3.8xl06  pounds),  and  one  of  Rembrandt’s  better  pupils, 
Gerrit  Don,  contributed  to  the  tremendously  successful  iconography  of  a  child  blowing  a 
soap  bubble,  “an  image  that  reflected  both  levity  and  gravity,  that  floated  like  childish 
reveries  and  popped  as  childhood  itself  had  to  end”  [1],  Many  other  Dutch  painters,  such 
as  Cornelis  Ketel  (perhaps  the  earliest  one,  on  the  back  of  a  1575  portrait,  see  Fig.  1  [2]), 
Hendrik  Goltzius,  Jacques  de  Gheyn,  Jan  Miense  Molenaer,  and  Frans  Hals  have  left  us 
a  number  of  images  of  children  blowing  bubbles.  It  is  interesting  that,  while  the  ancient 
authors  had  in  mind  the  bubbles  briefly  floating  on  water  surfaces,  from  Ketel  onward 
the  reference  is  always  to  soap  bubbles.  This  fact  is  probably  also  due  to  the  dramatic 
improvements  in  the  soap  technology  intervened  since  antiquity. 

In  the  mid  1730’s,  the  French  painter  Jean-Baptiste  Simeon  Chardin  used  his  five-year 
old  son  as  a  model  for  several  pictures  of  a  child  blowing  bubbles  and  later  again  an 
older  boy.  By  this  time,  however,  the  original  Calvinist  moral  embodied  in  the  Vanitas 
genre  had  faded  into  the  background  leaving  its  place  to  the  painter’s  delight  with  his 
subject  and  his  own  ability  as  we  find  in  several  others  artists  all  the  way  down  to  the 
pre-Raffaellite  Sir  John  Everett  Millais,  whose  painting  “Bubbles”  was  used  in  some  of 
the  Lever  Brothers’  commercial  advertising  for  Pear’s  Soap  [1], 

Other  than  its  career  as  a  lofty  metaphor,  the  word  bubble  has  also  enjoyed  another  - 
and  disreputable  -  one.  With  reference  to  the  bulla ,  a  spherical  gold  ornament  worn  by 
the  noble  Roman  children  after  an  Etruscan  custom,  Juvenal  uses  the  word  to  fashion  an 
insult:  “senior,  bulla  dignissime”  (“Old  man,  worthy  of  a  bubble,”  Satirae  13,  1.  33),  and 
in  old  Italian  holla  acquaiola  (“water  bubble”)  was  a  disparaging  designation.  In  Latin, 
animam  cbullire  (“to  gurgle  one’s  life  out,”  similar the  bubbles  that  gurgle  in  the  neck 
of  a  flask  when  the  liquid  is  poured  out)  was  a  vulgar  expression  for  “to  die”  used,  among 
others,  by  Seneca  and  Persius. 

In  Shakespeare’s  All’s  Well  That  Ends  Well  the  Second  Lord  Dumaine  qualifies  Paroles 
as  “On  my  life,  my  lord,  a  bubble”  (III,  vi,  6)  and  goes  on  to  explain:  “He’s  a  most  notable 
coward,  an  infinite  and  endless  liar,  an  hourly  promise-breaker,  the  owner  of  no  one  good 
quality  worthy  of  your  lordship’s  entertainment”  (ibid.,  10-12).  Later,  in  17th  and  18th 
century  English,  a  bubble  denoted  a  “dupe”  or  gullible  person.  In  the  underworld’s  slang, 
“to  betray”  was  indicated  as  to  bubble ,  with  bubble  also  standing  for  a  piece  of  information. 
Bubble  duster  was  used  for  “handkerchief”  (presumably  after  the  Scottish  usage  of  bubble 
for  “mucus,”  whence  also  bubbly  for  “snotty,  dirty”),  and  we  have  the  plethora  of  American 
slang  expressions  bubble  brain,  bubble  head ,  bubble  gurnmer  ( for  “teen-ager”),  bubble  dancer 
(for  “dish  washer,”  in  addition  to  the  original  reference  to  a  night-club  performer),  and 
others.  As  a  symbol  for  financial  bust,  one  does  not  have  to  wait  until  the  recent  crash  of 
the  Tokyo  stock  exchange  to  encounter  the  word.  The  South  Seas  Bubble  was  a  scheme 
devised  in  England  in  1711  to  eliminate  the  national  debt  -  at  the  expense  of  the  usual 
“bubbles”  -  and  the  French  were  quick  to  catch  on  with  their  similar  1717  Mississippi 
Bubble. 

In  French  faire  des  bulles  signifies  something  like  “to  make  a  fuss”  and  roincer  la  bulle 
or  buller  is  a  popular  expression  for  “doing  nothing.”  In  Dutch,  blaasjcs  verkopon  and 
the  corresponding  English  to  blow  bubbles  similarly  stand  for  “to  devise  baseless  theories, 
to  amuse  oneself  in  a  childish  manner,”  and  holla  di  vento  (“wind  bubble”)  recurs  in 
18th-century  Italian  in  a  similar  sense. 
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It  will  be  my  task  in  the  rest  of  this  paper  to  try  to  convince  the  reader  that,  in  spite 
of  all  of  the  above,  bubbles  are  “investigatione  dignissimae”  -  most  worthy  of  serious 
scientific  study. 

It  should  be  stressed  at  the  outset  that  the  type  of  bubbles  that  will  form  the  object  of 
this  paper  differ  in  a  major  way  from  those  featured  in  the  previous  quotations.  Although 
the  bursting  of  floating  bubbles  and  the  related  problem  of  the  stability  of  foams  is  a 
major  scientific  and  technical  problem,  here  I  shall  confine  myself  to  gas  bubbles  more 
deeply  submerged  in  a  liquid. 

The  breadth  of  the  subject  prevents  me  from  any  attempt  at  exhaustiveness.  I  will 
limit  myself  to  a  few  recent  developments  that,  hopefully,  will  give  an  idea  of  the  vast 
range  and  far-reaching  implications  of  bubble  phenomena. 

2.  SONOLUMINESCENCE 

From  the  mathematical  point  of  view,  the  richness  exhibited  by  bubble  phenomena 
may  be  considered  as  a  direct  consequence  of  the  1/r  singularity  of  the  fundamental 
solution  of  Laplace’s  equation  in  three-dimensional  space.  Since  bubble  dimensions  are 
often  “small”  (with  respect  to  other,  problem-dependent  lengths),  this  singular  behavior 
has  the  opportunity  to  manifest  itself  very  strongly.  With  bubbles,  one’s  life  is  enriched 
by  the  proximity  to  this  singularity. 

A  very  striking  phenomenon  due  precisely  to  this  fact  is  sonoluminescence ,  the  light 
emitted  by  a  sound-irradiated  liquid.  In  an  acoustic  wave  the  energy  density  is  of  the 
order  of  £  =  p'2/pc2,  where  p'  is  the  pressure  amplitude,  p  the  fluid  density,  and  c  the 
speed  of  sound.  Since  the  mole  density  per  unit  volume  is  pjM.  where  M  is  the  molecular 
mass,  the  corresponding  energy  per  molecule  is  (M / p)£ / N a,  where  Na  is  Avogadro’s 
number.  For  a  fairly  intense  pressure  wave  in  water  water,  with  p'  ~  1  bar,  this  amounts 
to  approximately  10-9  eV  per  molecule,  which  is  of  the  order  of  10-9  times  smaller  than 
the  energy  of  low-cV  radiation.  The  appearance  of  such  radiation  therefore  implies  that 
a  mechanism  exists  to  amplify  the  energy  density  by  a  factor  of  109. 

First  discovered  in  the  late  20’s,  the  effect  prompted  a  number  of  more  or  less  fanciful 
speculations  as  to  its  origin  which  subsequent  research  has  put  to  rest  -  to  some  extent 
[3J.  The  current  understanding  is  that  sonoluminescence  is  due  to  the  dissociation  and 
recombination  of  gaseous  molecules  contained  in  bubbles  that  pulsate  in  a  strongly  non¬ 
linear  fashion  under  the  action  of  the  sound  field.  It  is  the  1/r  singularity  of  the  potential 
for  the  associated  radial  motion  of  the  liquid  that  is  responsible  for  the  effect. 

A  theory  of  radial  bubble  pulsations  and  sonoluminescence  is  given  elsewhere  [4].  For 
purposes  of  illustration  I  reproduce  here  in  Fig.  2  a  graph  of  the  internal  pressure  (solid 
line)  and  center  temperature  (dashed  line)  for  an  argon  bubble  in  water  subject  to  a  0.93 
bar  sound  field.  The  bubble  radius  is  26  pm  and  the  sound  frequency  21  kHz.  Since  the 
radius  of  a  resonant  bubble  at  this  frequency  would  be  150  pm,  the  bubble  is  driven  at 
about  one  fifth  of  its  resonance  frequency.  A  parallel  chemical  kinetics  calculation  [4], 
the  results  of  which  are  shown  in  Fig.  3,  indicates  that  a  substantial  number  of  hydroxyl 
radicals  are  produced  in  these  conditions  by  collisional  dissociation  of  the  vapor  molecules. 
This  prediction  is  in  agreement  with  the  experimental  fact  that  liquids  saturated  with 
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Figure  2:  Pressure  (dashed  line)  and  center  temperature  (solid  line)  during  steady  oscil¬ 
lations  of  a  26  /im-radius  argon  bubble  driven  at  21  kHz  and  0.93  bar  acoustic  pressure 
amplitude 

Figure  3:  Number  of  hydroxyl  radicals  as  a  function  of  time  during  the  steady  oscillations 
of  the  argon  bubble  of  the  previous  figure  (so  3  line),  and  an  equal  one  containing  neon 
(dashed  line). 

noble  gases  emit  a  strong  310  nm-line  corresponding  to  a  transition  of  the  excited  OH 
radical  to  its  ground  state  [5].  Heat  transport  inside  the  bubble  has  a  strong  effect  on  this 
process.  Neon  I  as  a  thermal  conductivity  3-4  times  that  of  argon,  and  the  corresponding 
OH  number,  shown  in  Fig.  3  by  the  dashed  line,  is  much  smaller  than  for  the  argon 
case.  It  should  be  realized  that,  during  the  period  of  peak  temperature,  the  temperature 
gradient  inside  the  bubble  is  of  the  order  of  10s  K/mm.  Clearly,  this  is  only  possible  due 
to  the  very  short  duration  of  the  peak  temperature,  for  this  example  of  the  order  of  300 
nsec. 

However  short,  this  lime  interval  is  many  orders  of  magnitude  longer  than  the  duration 
of  the  sonoluminescence  light  flashes,  which  is  currently  estimated  below  50  psec  [6).  This 
extremely  brief  duration  is  perhaps  the  most  baffling  aspect  of  sonoluminescence.  A 
possible  explanation  could  be  the  following.  It  is  well  known  since  Wood’s  experiments 
in  1904  that  gases  are  very  opaque  to  radiation  corresponding  to  their  emission  bands.  It 
is  for  this  reason,  for  example,  that  we  can  only  observe  the  outermost  layer  of  the  sun 
as  the  light  emitted  by  the  inner  regions  is  absorbed  by  the  surrounding  layers.  As  the 
bubble  compresses,  the  temperature  is  highest  at  the  center  and,  when  a  certain  threshold 
is  exceeded,  H20  molecules  begin  to  dissociate  there.  A  fraction  of  them  is  excited  and 
radiates  upon  decaying  to  the  ground  level.  Very  soon  afterward,  the  temperature  in 
the  next  layer  of  gas  exceeds  the  O //-production  threshold,  and  these  radicals  are  now- 
available  to  absorb  the  radiation  still  coming  from  the  center  of  the  bubble.  For  the 
case  considered  previously  our  computations  indicate  that  it  takes  about  100  psec  for  the 
temperature  “wave”  to  travel  to  a  distance  of  0.01  radii  from  the  origin  center. 

To  explain  why  no  light  is  emitted  from  the  outer  layer,  one  may  note  that,  by  sym¬ 
metry,  the  temperature  gradient  must  vanish  at  the  bubble  center  (another  consequence 
of  the  presence  of  a  singularity  !).  Hence,  the  temperature  in  the  central  region  of  the 
bubble  is  much  more  homogeneous  than  in  the  surrounding  layers.  In  particular,  in  the 


Figure  4:  Successive  shapes  of  a  20  /an  gas  bubble  driven  at  20  kHz  by  a  1  bar-amplitude 
sound  field  switched  on  at  t  =  0.  The  time  unit  is  10.5  psec.  Initially  the  bubble  is  located 
1/4  wavelength  above  the  antinode  of  the  sound  field. 

outermost  layer  where  the  temperature  barely  exceeds  the  threshold,  virtually  all  the 
OH' s  are  produced  in  the  ground  state  and  therefore  can  absorb,  but  not  emit,  radiation. 
Clearly,  radiationless  transitions  to  the  ground  state  must  also  occur,  and  these  are  ren¬ 
dered  probable  by  the  high  number  density  of  the  molecules  (of  the  order  of  1018/cm3  for 
the  example  of  Figs.  2  and  3). 

With  numbers  of  this  order  of  magnitude  involved,  one  might  wonder  whether  one 
could  get  “cold  fusion”  to  occur  in  a  liquid  subjected  to  a  sufficiently  intense  sound  field 
and,  indeed,  the  idea  for  such  a  reactor  has  been  patented  [7],  Unfortunately,  there  is  a 
snag:  the  spherical  shape  of  an  oscillating  bubble  has  a  strong  tendency  to  instability, 
so  that  any  attempt  to  drive  the  bubble  too  strongly  will  lead  to  its  shattering  rather 
than  to  increased  temperatures  and  pressures.  This  instability  is  similar  to  the  well- 
known  Rayleigh-Taylor  one  as,  when  the  pressure  in  the  bubble  rises,  the  liquid  is  being 
accelerated  upon  by  a  lighter  fluid,  the  gas. 

We  present  the  results  of  a  calculation  that  illustrates  this  instability  in  Fig.  4.  The 
bubble  here  has  a  radius  of  20  pm  and  is  forced  by  a  1-bar,  20-kHz  plane  standing  acoustic 
wave.  In  spite  of  the  fact  that  the  wavelength  is  nearly  4,000  times  larger  than  the  bub¬ 
ble  radius,  the  non  zero  pressure  gradient  at  the  bubble  location  (1/4  wavelength  above 
the  antinode)  induces  a  perturbation  that,  as  can  be  seen,  is  strongly  amplified  during 
the  compression  phase  of  the  oscillations.  This  calculation  ignores  various  dissipative 
processes  and  therefore  it  probably  overestimates  somewhat  the  magnitude  of  the  effect. 
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Figure  5:  Bubble  entrapment  (last  frame)  by  a  2.86  mm-diameter  water  drop  impacting 
a  water  surface  at  2  m/sec  (first  frame).  The  free  surface  of  the  liquid  is  shaded.  Times 
are  in  msec. 

3.  SOUND  GENERATION 

A  completely  different  range  of  phenomena  in  which  bubbles  are  of  major  importance 
is  sound  production  in  liquids.  The  flushing  of  a  toilet  or  simply  the  filling  of  a  glass 
usually  make  a  far  louder  noise  than  water  flowing  in  the  pipes  of  a  house.  The  reason 
is  that  a  homogeneous  liquid  emits  sound  due  to  turbulence,  which  is  a  quadiupole- 
type  source,  while  bubbles  behave  as  monopole  sources.  To  illustrate  these  processes, 
I  will  briefly  discuss  two  bubble-related  mechanisms  responsible  for  the  production  of 
oceanic  underwater  noise.  Since  sound  propagates  over  great  distances  in  the  ocean, 
these  mechanisms  are  of  importance,  among  other  reasons,  as  diagnostic  tools  for  oceanic 
conditions  and  processes. 

An  interesting  example  is  rain  noise  that,  at  low  wind  speeds,  is  experimentally  found 
to  possess  a  spectrum  -  strongly  peaked  at  about  14  kHz  -  very  nearly  independent  of  rain 
drop  size  distribution  and  rain  intensity.  This  seeming  “universality”  of  the  rain  noise 
spectral  shape  has  puzzled  early  investigators  until  the  key  was  identified  in  the  fluid 
dynamics  of  bubble  production  by  drops  striking  a  free  surface.  This  work  is  discussed  at 
length  elsewhere  [8].  Briefly,  it  is  found  that  drops  striking  a  liquid  surface  make  very  little 
noise  unless  a  bubble  is  entrained  in  the  liquid.  The  results  of  a  calculation  illustrating 
this  process  a**e  presented  in  Fig.  5.  Bubble  entrapment  occurs  only  in  a  narrow  range 
of  drop  sizes  and  impact  velocities  which,  for  impacts  at  terminal  velocity,  corresponds 
to  diameters  between  about  0.8  and  1.1  mm.  Since  drops  in  this  size  range  are  always 
present,  and  since  other  drops  are  acoustically  irrelevant,  all  rains  sound  very  much  alike. 

Anyone  who  has  listened  to  a  wave  breaking  on  the  shore  is  quite  aware  that  a  sub¬ 
stantial  amount  of  noise  exists  in  the  ocean  also  at  frequencies  much  lower  than  the  14-20 
kHz  characteristic  of  rain  noise.  As  a  matter  of  fact,  measurements  indicate  a  substantial 
amount  of  wind-dependent  (and  therefore  not  man-made)  oceanic  noise  at  frequencies  of 
a  few  hundred  Hz  and  lower  [9],  The  linear  natural  frequency  of  oscillation  of  a  bubble 
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having  an  equilibrium  radius  a  is,  approximately  [lOj, 
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where  p0  is  the  equilibrium  internal  pressure  in  the  bubble  and  1  <  k  <  7,  with  7  the  ratio 
of  the  specific  heats,  is  a  polytropic  index  dependent  in  a  complex  way  on  the  frequency. 
According  to  this  formula,  for  a  bubble  to  pulsate  at  100  Hz,  the  diameter  should  be 
6-7  cm.  It  is  very  unlikely  that  such  large  bubbles  are  produced  in  substantial  numbers 
by  waves  that,  in  the  open  ocean,  break  in  a  spilling  rather  than  the  plunging  inode 
characteristic  of  near-shore  breaking.  Instead,  there  are  substantial  reasons  to  believe 
that  this  low-frequency  noise  is  originated  by  the  collective  oscillations  of  the  bubble 
clouds  produced  by  breaking  waves  [11,12],  A  simple  argument  can  be  presented  in  the 
following  terms.  At  low  frequencies,  the  speed  of  sound  cm  in  a  liquid  containing  gas 
bubbles  is  given  approximately  by 
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where  0  =  Vg/V  is  the  fraction  of  the  mixture  volume  V  occupied  by  the  gas.  The 
simplest  way  to  derive  this  expression  is  to  note  that,  if  the  liquid  compressibility  is 
neglected,  a  change  in  the  mixture  density  pm  can  only  be  due  to  a  change  in  the  volume 
Vo  occupied  by  the  gas  sc  that  6pm/pm  =  —6Vg/V.  On  the  other  hand,  if  the  bubbles 
compress  polytropicallv,  SVgIVg  —  —^Sp/po  and  (2)  readily  follows  upon  identifying  c7m 
with  Spjdpm ■  1 

The  striking  fact  is  that  already  at  very  small  volume  fractions  em  is  much  lowrer  than 
c.  For  instance,  for  0  =  1%  and  atmospheric  pressure,  one  finds  cm  ~  100  m/sec  to  be 
compared  with  c  =  1500  m/sec  in  pure  water.  Thus,  in  view  of  the  great  disparity  between 
c  and  cm  a  bubble  cloud  can,  to  a  first  approximation,  be  regarded  as  a  compressible 
medium  in  a  rigid  enclosure.  The  lowest  normal  mode  of  oscillation  of  the  cloud  has 
therefore  a  frequency  wmin  of  the  order  of  cm / L ,  where  L  is  the  linear  scale  of  the  cloud, 
so  that,  from  (1)  and  (2),  one  finds 
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where  N  is  the  number  of  bubbles  in  the  cloud.  This  result  shows  that  1,000  1-mm 
bubbles  -  which  individually  have  a  natural  frequency  of  about  3  kHz  -  would  give  rise  to 
a  cloud  oscillating  at  about  300  Hz.  Since  the  liquid  surrounding  the  cloud  is  not  actually 
incompressible,  the  cloud  eigenrnodes’  energy  leaks  out  and  is  radiated  to  great  distances. 

An  indirect  quantitative  confirmation  of  this  process  has  been  obtained  by  using  an 
“artificial”  cloud  in  the  laboratory  [13].  If  air  is  bled  out  of  a  disk-shaped  bank  of  needles 
at  the  bottom  of  a  tank,  a  hydrophone  picks  up  a  signal  similar  to  that  shown  in  Fig.  6. 
The  peaks  correspond  to  eigenfrequencies  of  the  system  and  can  be  calculated  as  a  check 
of  the  theory.  Some  results  of  this  comparison  are  shown  in  Fig.  7. 

’The  result  (2)  fails  to  converge  to  the  correct  limit  as  0  — ►  0  since  the  liquid  compressibility 
has  been  neglected.  It  can  be  shown  that  this  is  justified  at  gas  volume  fractions  above  about 
10~3%. 


364 


Frequency  i  Hz) 


Figure  6:  Frequency  spectrum  recorded  in  a  laboratory  tank  in  which  a  steady  cylindrical 
column  of  bubbles  is  maintained  by  air  injection  at  the  bottom.  Note  the  many  resonances 
of  the  system.  (From  Nicholas  et  al. ,  Ref.  13.) 

Figure  7:  Comparison  between  theory  and  experiment  for  the  frequencies  of  the  normal 
modes  shown  in  the  previous  figure.  The  horizontal  scale  is  the  gas  volume  fraction.  The 
differences  at  the  higher  frequencies  are  due  to  the  proximity  of  the  resonance  frequency 
of  the  individual  bubbles.  (From  Nicholas  et  al.,  Ref.  13.) 

The  agreement  deteriorates  at  the  lower  volume  fractions  due  to  the  proximity  of  the 
resonance  frequency  of  the  individual  bubbles,  approximately  1.9  kHz  in  this  case  [3  4]. 

It  is  remarkable  that  the  only  source  of  excitation  for  this  system  is  the  “pinch-off” 
noise  emitted  by  the  bubbles  as  they  leave  the  needles.  A  computed  example  of  this 
process  is  shown  in  Fig.  8,  where  the  compression  of  the  bubble  as  it  detaches  from 
the  needle  is  quite  clear.  In  a  breaking  wave,  the  processes  giving  rise  to  the  formation 
of  the  bubbles  are  far  more  violent  and  a  much  higher  noise  emission  level  can  be  expected. 

4.  SHOCK  WAVES  IN  BUBBLY  LIQUIDS 

The  last  example  of  the  previous  section  provides  an  instance  in  which  it  is  not  possible 
nor  desirable  to  account  in  detail  for  the  behavior  of  the  individual  bubbles,  but  it  is  more 
efficient  to  treat  the  liquid-gas  mixture  as  a  continuum  endowed  with  a  complex  internal 
structure.  The  area  of  gas-liquid  multi-phase  flows  is  one  of  great  industrial  relevance 
which  is  currently  the  object  of  intense  research  efforts. 

After  the  case  of  linear  pressure  waves  touched  upon  in  the  previous  section,  perhaps 
the  simplest  case  to  consider  is  that  of  shock  waves  propagating  in  a  dilute  mixture  of 
bubbles  and  liquid.  Such  waves  may  originate  e.g.  in  pipelines  during  operational  tran¬ 
sients.  Experiment  [15]  shows  that  the  steady  profile  of  weak  waves  is  either  completely 
smooth  (Fig.  9,  left)  or  smooth  at  the  wave  front  and  “rippled”  in  the  middle  (Fig.  9, 
right).  Our  recent  study  [16]  shows  that  this  fact  is  due  to  the  peculiar  nature  of  the 
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Figure  8:  Successive  shapes  of  a  bubble  blown  out  of  a  1  mm-diameter  needle. 


Figure  9:  Examples  of  pressure  profiles  in  steady  shock  waves  in  a  bubbly  liquid.  At 
left  is  a  completely  smooth  waveform,  at  right  an  oscillating  one.  The  dashed  line  is  the 
effective  poly  tropic  index  along  the  wave.  (From  Ref.  16). 
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dispersion  relation  for  pressure  waves  in  a  bubbly  liquid.  Consider  lirst  linear  waves  at 
very  low  frequency.  As  the  bubbles  are  compressed,  the  gas  temperature  has  time  to 
equilibrate  with  that  of  the  surrounding  liquid,  the  process  is  isothermal  and,  in  Eq.  (2). 
k  =  1.  As  the  frequency  increases,  complete  temperature  equilibration  is  not  possible  and 
k  becomes  greater  than  1  so  that  the  speed  of  sound  increases.  At  still  higher  frequencies 
the  natural  frequency  of  the  bubbles  is  approached,  it  is  easier  and  easier  to  compress 
them,  and  physical  processes  not  accounted  for  in  Eq.  (2)  greatly  decrease  the  speed  of 
sound.  The  phase  velocity  of  the  waves  possesses  therefore  a  mild  maximum  below  the 
resonance  frequency  of  the  bubbles.  This  feature  is  clear  in  Fig.  10  where  the  solid  lines 
show  the  dispersion  relations  corresponding  to  the  region  near  the  front  and  the  back  of 
the  waves  of  Figs.  9.  The  dashed  lines  are  the  group  velocity.  The  speed  U  of  the  shock 
itself,  shown  by  the  horizontal  dash-and-dot  lines,  is  given  by 
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where  the  subscripts  a  and  b  refer  to  conditions  ahead  and  behind  the  wave  respectively. 
The  two  panels  of  Fig.  10  exhibit  a  major  qualitative  difference.  In  the  case  of  the  some'  1. 
shock,  there  is  a  substantial  range  of  frequencies  where  monochromatic  waves  near  the 
front  and  the  back  propagate  faster  than  the  shock  itself:  any  perturbation  falling  in  these 
frequency  ranges  will  therefore  propagate  away  from  the  shock.  The  shock  is  only  faster 
than  very  low -frequency,  long-wavelength  perturbations  --  which  exceed  the  scale  of  the 
shock  and  therefore  are  not  generated  -  or  than  high-frequency,  highly  damped  ones.  In 
this  situation  the  integrity  of  the  shock  can  only  be  maintained  by  nonlinear  effects  and 
the  profile  is  smooth.  In  the  case  of  the  oscillating  shock,  however,  the  shock  speed  is 
faster  than  that  of  any  wave  near  its  front,  which  is  therefore  steep  and  smooth.  In  the 
intermediate  shock  region,  however,  waves  having  a  speed  comparable  with  that  of  the 
shock  are  possible,  and  it  is  these  waves  that  remain  “trapped”  in  the  shock  and  confer 
to  it  the  oscillatory  nature  seen  in  the  Fig.  10. 


5.  CONCLUSIONS 

Reasons  of  space  have  only  allowed  me  to  give  the  briefest  indication  of  some  cur¬ 
rent  problems  in  the  mechanics  of  bubbles  and  bubbly  liquids.  It  has  not  been  possible 
to  even  mention  many  important  topics  such  as  cavitation  -  both  flow  and  acoustic  - 
bubble  nucleation,  break-up,  and  coalescence,  air  entrainment  in  flowing  liquids,  biomed¬ 
ical  applications,  micro-gravity  manufacturing,  boiling  heat  transfer,  and  many  others. 
Gas-liquid  multiphase  flows  are  of  great  relevance  e.g.  in  the  power  generation  and  oil 
industries  and  very  poorly  understood.  One  can  expect  theoretical  as  well  as  experimental 
challanges  in  these  areas  for  many  years  to  come. 

In  conclusion  I  would  like  to  return  for  a  moment  to  the  single  pulsating  bubble  with 
an  idea  that  is  perhaps  far-fetched  but  nevertheless  intriguing.  It  is  possible  that  here  we 
shall  find  a  “power  plant”  for  the  micro-robots  of  the  future.  It  is  not  inconceivable  that 
such  machines  could  be  used  e.g.  inside  the  human  body  where  power  from  any  source 
other  than  acoustic  would  be  impractical. 
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Figure  10:  Dispersion  relations  for  the  two  shock  waves  of  F'ig.  9. The  horizontal  line  is  the 
wave  velocity,  the  solid  lines  the  phase  velocity  and  the  dashed  lines  the  group  velocity. 
The  upper  pair  of  curves  is  for  conditions  at  the  back  of  the  wave,  and  the  lower  pair  for 
conditions  near  the  front  of  the  wave.  (From  Ref.  16.) 
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APPENDIX  -  AN  ETYMOLOGICAL  NOTE 

According  to  Ref.  [17],  the  Greek  word  for  bubble,  nop4>6\v£,  is  related  through  an 
intermediate  variant  7r op4>6<;  to  7rep<pt<f  which  has  the  basic  meaning  of  “something  that 
is  created  by  inflating  or  blowing.”  Interestingly  one  of  the  etymologies  offered  for  the 
Latin  bulla  is  from  the  root  <f>a\—  (whence  e.g.  phallus ),  indicating  “any  object  swelling 
up”  [18].  However,  there  does  not  seem  to  be  any  consensus  on  this  point  as  other  authors 
indicate  a  derivation  from  the  Sanskrit  belih,  which  denotes  the  external  female  genitals 
[19].  Here  the  connection  might  be  through  the  idea  of  “rotundity.”  Another  Greek  word 
sometimes  juxtaposed  might  be  0o\0o<;,  “bulb”  [17]. 

Be  that  as  it  may,  it  seems  very  likely  that  onomatopoeic  elements  have  also  shaped 
the  word,  as  we  find  the  basic  sound  in  unrelated  languages  such  as  Sumerian  ( bubul ), 
Accadian  (bubutu),  Hindi  ( bud-bud ,  bull-bula),  Bengali  (bud-bud),  Cebuano-Visayan  ( buld ), 
and  others. 

As  for  the  English  bubble  and  its  ancestors  boble ,  bobel,  bubbul,  the  onomatopeic  ele¬ 
ment  is  probably  dominant  as  also  indicated  by  the  closeness  with  burble  [20].  Parallel 


substantives  ill  other  Teutonic  languages  are  e.g.  the  Swedish  bubbla,  the  Danish  boblt , 
arid  the  dialectal  German  words  bobbt  l ,  bubbcl.  The  c  urrent  German  word  for  “bubble  '  is 
blase,  similar  to  the  Dutch  blaas ,  “bladder,” 

All  the  modern  Romance  languages  have  a  word  deriving  from  the  Latin  bulla,  al¬ 
though  sometimes  the  meaning  of  “bubble”  has  been  lost.  An  example  is  Spanish  where 
the  word  bula  now  primarily  denotes  the  seal  apposed  to  Papal  documents,  but  was  in 
the  past  used  for  burbuja ,  that  is  the  modern  word  for  “bubble.” 


REFERENCES 

1.  S.  Schama,  The  Embarrassment  of  Riches,  Knopf,  New  York,  1987,  pp.  512-516. 

2.  W.  Stechov,  The  Art  Bulletin  20,  227-228  (1938). 

3.  R.E.  Venal  and  C.M.  Sehgal,  in  K.S.  Suslick  (ed.),  Ultrasound:  its  Chemical,  Phys¬ 
ical,  ancl  Biological  Effects,  VC11  Publishers,  New  York,  1988,  pp.  227-286. 

4.  V.  Kaniath,  A.  Prosperetti,  ancl  F.  Egolfopoulos,  J.  Acoust.  Soc.  Ain.,  in  press. 

5.  C.  Sehgal,  R.G.  Sutherland,  and  R.E.  Verrall,  J.  Phys.  Chem.  84,  388-395  (1980). 

6.  B.P.  Barber  ancl  S.J.  Putterman,  Nature  352,  318-319  (1991);  B.P.  Barber,  R.  Hiller, 
K.  Arisaka,  H.  F-dtcrman,  and  S.J.  Putterman,  J.  Acoust.  Soc.  Am.  91,  3061-3063 
(1992). 

7.  H.G.  Flynn,  Method  of  generating  energy  by  acoustically  induced  cavitation  fusion 
and  reactor,  U.S.  Patent  No.  4,333,796  (1982). 

8.  A.  Prosperetti,  L.A,  Crum,  and  H.C.  Pumphrey,  J.  Geophys.  Res.  94,  3255-3259 
(1989);  II. N.  Oguz  and  A.  Prosperetti,  J.  Fluid  Meek.  219,  113-179  (1990);  H.N. 
Oguz  and  A.  Prosperetti,  J.  Fluid  Mech.  228,  417-442  (1991);  H.N.  Oguz  and  A. 
Prosperetti,  Ann.  Rev.  Fluid  Mech.,  in  press. 

9.  see  e.g.  B.Il.  Kerman,  J.  Acoust.  Soc.  Am.  75,  149-165  (1984);  P.C.  Wille  and  D. 
Geyer,  J.  Acoust.  Soe.  Am.  75,  173-185  (1984);  D.J.  Kcwley,  D.G.  Browning,  and 
W.M.  Carey,  J.  Acoust.  Soc.  Am.  88,  1894-1902  (1990). 

10.  M.S.  Plesset  and  A.  Prosperetti,  Ann.  Rev.  Fluid  Mech.  9,  145-185  (1977). 

11.  W.M.  Carey,  J.  Acoust.  Soc.  Am.  78,  S1-S2  (1985);  W.M.  Carey  and  D.  Browning, 
in  B.R.  Kerman  (ed.),  Sea  Surface  Sound,  Kluwer,  Boston,  1988,  pp.  361-376;  W.M. 
Carey,  J.W.  Fitzgerald,  and  D.G.  Browning,  in  B.R.  Kerman  (ed.),  Natural  Sources 
of  Underwater  Sound,  Kluwer,  Boston,  in  press. 


12.  A.  Prosperetti,  J.  Acoust.  Soc.  Am.  78,  S2  (1985);  J.  Acoust.  Soc.  Am.  84, 
1042-1054  (1988);  A.  Prosperetti,  N.Q.  Lu,  and  II. S.  Kim,  J.  Acoust.  Soc.  Am.,  in 


36') 


13.  N.Q.  Lu,  A.  Prosperetti,  and  S.W.  Yoon,  IEEE  J.  Octan  Eng.  15,  275-281  (195)0); 
S.w.  Yoon,  L.A.  Crum,  A.  Prosperetti,  and  N.Q.  Lu,  J.  A  roust.  Soc.  Am.  89, 
700-706  (1991);  Nl.  Nicholas,  L.A.  Crum,  R.A.  Roy,  H.N.  Oguz,  and  A.  Prosperetti, 
J.  Acoust.  Soc.  Am.,  submitted. 

14.  K.W.  Commander  and  A.  Prosperetti,  J.  Acoust.  Soc.  Am.  85,  732-746  (1989). 

15.  L.  Noordzij  and  L.  van  Wijngaarden,  J.  Fluid  Mech.  66,  115-143  (1974). 

16.  M.  VVatanabe  and  A.  Prosperetti,  J.  Fluid  Mech.,  submitted. 

17.  P.  Chantraine,  Dictionnaire  Etymoloyique  de  la  Langue  Grecquc,  Klincksieck,  Paris, 
1968. 

18.  C.T.  Lewis  and  C.  Short,  A  Latin  Dictionary ,  Clarendon,  Oxford,  1975  (first  edition 
1879). 

19.  A.  Ernout  and  A.  Meillet,  Dictionnaire  Etymologique  de  la  Langue  Latine ,  Klinck¬ 
sieck,  Paris,  1959. 

20.  J.A.  Simpson  and  E.S.C.  Weiner  eds.,  Oxford  English  Dictionary ,  Clarendon,  Ox¬ 
ford,  1989. 


Theoretical  and  Applied  Mechanics  1992 

S.R.  Bodner,  J.  Singer,  A.  Solan  &  Z.  Hashin  (Editors) 

Elsevier  Science  Publishers  B.V.  .»? ! 

1993  IUTAM. 


Wave  propagation  in  non-isotropic  structures 

Mahir  Behar  Sayir 


Institute  of  Mechanics,  ETH,  Zurich,  Switzerland 


Abstract 

In  structural  wave  propagation  the  characteristic  wavelengths  of  the  propagating  pulse 
are  chosen  to  be  sufficiently  large  with  respect  to  the  thickness  of  the  structure.  Narrow- 
band  transient  structural  waves  are  particularly  well-suited  for  example  to  study  and 
quantitatively  evaluate  global  ("homogenized")  dynamical  properties  of  sandwich  or  fiber- 
reinforced  composite  structures  such  as  laminates,  tubes  etc.  The  waves  are  generated  in 
various  membrane  or  flexural  modes  by  piezo-electric  transducers  excited  at  narrow  fre¬ 
quency  bands  well  below  their  resonant  frequencies  to  avoid  coupling  and  thus  allow  per¬ 
fect  control  of  the  pulse  shape.  The  theoretical  models  used  to  interpret  and  evaluate  the 
experimental  results  obtained  in  sandwich  and  composite  structures  must  be  carefully  de¬ 
veloped  in  order  to  predict  accurately  the  features  following  from  the  interaction  of  the 
various  geometric,  dynamic  and  material  scales.  A  few  experimental  results  obtained  in 
recent  years  in  our  laboratory  are  discussed  in  conjunction  with  the  corresponding  theoret¬ 
ical  models.  Part  of  these  results  are  also  used  to  interpret  dynamical  phenomena  ob¬ 
served  during  the  propagation  of  fast-running  cracks  in  cross-ply  laminates. 


1.  TERMINOLOGY  AND  MOTIVATION 


Consider  a  transient  wave  in  the  amplitude  range  of  linear  elasticity  in  a  thin  structure 
such  as  a  beam,  plate  or  shell  .  The  time  function  is  supposed  to  be  of  narrow-band  type 
and  can  be  generated  by  applying  a  suitable  window  (for  example  a  translated  cosine- 
window  of  relatively  large  duration)  on  a  sine-  or  cosine-function  of  a  given  frequency.  In 
the  amplitude-frequency  spectrum  of  this  function  the  non-vanishing  amplitudes  will  lie  for 
ail  practical  purposes  between  two  frequency  limits.  The  higher  limit  finax  will  correspond 
in  the  structure  to  a  wavelength  X  which  we  call  the  characteristic  wavelength  of  the 
travelling  pulse.  In  a  structural  wave  the  ratio  of  the  largest  thickness  H  of  the  structure  to 
the  characteristic  wavelength  of  the  propagating  pulse  (called  "dynamic  parameter ")  is 
supposed  to  be  sufficiently  small,  i.e. 


ex  H/L  «1 
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The  stress-  and  strain-distributions  generated  by  such  a  wave  across  the  thickness  of  the 
structure  are  stationary  and  usually  quite  simple  mathematical  functions  of  the  thickness 
coordinate.  The  wave  travels  then  in  membrane,  torsional  or  flexural  modes,  with  possible 
coupling  between  them.  Furthermore,  in  fiber-reinforced  composite  structures,  due  to  the 
much  tuwer  scale  of  the  fiber  diameter  or  of  the  ply  thickness,  most  of  the  important  fea¬ 
ture*  of  structural  wave  propagation  can  be  interpreted  by  assuming  homogeneous  mate- 
oal  behaviour. 

Per  definition  in  anisotropic  materials  with  orthotropic  symmetry  three  mutually  per¬ 
pendicular  principal  axes  of  material  symmetry  can  be  associated  with  each  material  point. 
With  respect  to  structural  waves  structures  may  behave  orthotropically  either  because 
the  material  itself  is  orthotropic  or  because  a  combination  of  geometric  features  and  mate¬ 
rial  distribution  through  the  corss-section  of  the  structure  generate  "globally”  non-isotro¬ 
pic  structural  behaviour  in  spite  of  the  fact  that  the  material  itself  is  isotropic.  An  interest¬ 
ing  case  of  such  "structural  non- isotropy”  has  been  studied  in  f  1 J,  where  a  flexural  wave 
in  an  aluminum  beam  with  nearly  perfect  circular  cross-section  (measured  deviations  from 
a  circular  boundary  less  then  0.05  %)  "rotated"  most  probably  because  of  an  extremely 
slight  inhomogeneity  in  the  distribution  of  the  axial  modulus  of  elasticity  E  through  the 
cross-section  (largest  difference  in  the  weighted  average  of  the  modulus  in  two  perpendi¬ 
cular  directions  of  the  order  of  1  %).  The  material  itself  was  shown  to  behave  istoropically 
within  the  accuracy  of  the  measurements. 

In  isotropic  materials  the  ratio  of  the  shear  modulus  G  to  the  tensile  modulus  F 

em  :=  G  /  E 

which  will  be  called  "material  parameter"  in  the  following  has  to  be  larger  than  0.33. for 
reasons  of  material  stability.  Of  course  in  anisotropic  structures  such  as  fiber-reinforced 
laminates  or  filament-wound  tubes  this  material  parameter  may  reach  much  lower  values 
(for  example  0.01).  Also  in  sandwich  structures  with  a  soft  isotropic  core  and  stiff 
isotropic  facings  this  parameter  (determined  from  equivalent  stiffness  values)  is  usually 
quite  low.  In  our  laboratory  sandwich  beams  and  plates  with  £m  as  low  as  0.0004  have 
been  tested  [2].  Thus  from  the  point  of  view  of  effects  associated  with  low  values  of  the 
material  parameter  Em  (for  example  transverse  shear  coupling)  sandwich  structures  may 
behave  non-isotropically  even  though  the  material  of  the  components  is  isotropic. 

Our  experimental  research  activity  on  wave  propagation  in  non-isotropic  structures 
which  started  about  15  years  ago  was  first  motivated  by  the  need  to  reach  a  better  under¬ 
standing  of  coupling  phenomena  which  are  characteristic  of  non-isotropic  behaviour.  Thus 
to  study  mode-coupling  in  non-principal  directions  we  investigated  flexural  waves  in  unidi- 
rectionally  reinforced  plates  [3]  or  in-plane  waves  in  cross-ply  laminates  [4,  17].  To  study 
coupling  phenomena  related  to  low  values  of  Em  we  generated  flexural  waves  in  unidirec- 
tionally  reinforced  beams  [5],  in  sandwich-plates  with  extremely  soft  cores  {2]  or  axisym- 
metric  waves  in  filament-wound  tubes  [6,  4],  These  experimental  investigations  were 
interpreted  by  developing  for  each  case  appropriate  theoretical  models  based  on  a  careful 
asymptotic  analysis  of  the  basic  relations  of  three-dimensional  linear  elasticity.  Approxi¬ 
mations  based  on  a  priori  assumptions  were  avoided  since  in  all  cases  the  presence  of  at 
least  two  competing  small  parameters,  namely  e\,  Em  and  of  some  other  small  geometric 
ratios  (as  in  sandwich  plates  or  filament-wound  tubes)  rendered  the  choice  of  physically 
justified  simplifying  assumptions  without  a  straightforward  ordering  scheme  rather  diffi¬ 
cult.  With  a  more  thorough  knowledge  of  the  above-mentioned  coupling  phenomena  we 
were  also  able  to  determine  structural  properties  such  as  flexural  stiffnesses  and  various 


moduli  of  elasticity  with  quite  high  degrees  of  accuracy  }7j.  Our  experimental  and  theoreti¬ 
cal  methods  of  analysis  can  also  he  used  to  detect  Haws  or  structural  errors  which  affect 
local  stiffness  values  to  some  extent,  even  though  the  characteristic  wavelengths  of  the 
generated  pulses  may  be  much  larger  than  the  size  of  the  flaws.  Especially  flexural  waves 
which  by  nature  are  particularly  sensitive  to  changes  in  thickness  are  well  suited  to  detect 
flaws  such  as  delaminations  or  to  check  for  possible  errors  in  ply  combinations,  filament- 
wounding  etc.  Details  related  to  current  research  on  applications  to  non  destructive 
detection  will  be  reported  elsew-here.  In  the  following  1  will  present  a  review  of  the  main 
features  related  to  the  experimental  and  theoretical  work  briefly  mentioned  above, 
together  with  some  unpublished  recent  results. 


2.  SHEAR  COUPLING  IN  FLEXURAL  WAVES 


2.1  Unidirectionally  reinforced  beams 

In  (8]  a  theory  for  flexural  w-aves  with  large  wavelengths  in  unidirectionally  reinforced 
beams  was  elaborated.  According  to  this  theory  which  was  derived  by  asymptotic 
analysis  of  the  three-dimensional  equations  of  linear  elasticity,  the  order  of  magnitude  of 
the  parameter 

h  rv 

p:=7TVo  <2-11 

determines  decisively  the  propagation  characteristics  of  transverse  structural  waves  in 
beams.  The  following  table  summarizes  the  role  of  the  parameter  p: 


Order  of  magnitude 

Degree  of  anisotropy 

Nature  of  the  shear  coupling 

p  «  1 

Isotropy, 

Weak  or  moderate  anisotropy 

Weak :  Becomes  active  at  the 
second  step  of  approximation 

p  —  0  ( 1 ) 

"Quasi-strong"  anisotropy 
(transitional  case) 

Strong:  Becomes  active  at  the 
first  step  of  approximation 

p  »  1 

Strong  anisotropy 

Sandwich- behaviour 

In  the  transitional  case  of  "quasi-strong"  anisotropy  with  p  =  O  (1)  the  usual  assump¬ 
tion  that  cross-sections  remain  plane  after  deformation  does  not  hold  since  the  transverse 
shear  strains  become  so  large  that  they  reach  the  magnitude  of  the  axial  strains  due  to 
bending.  Because  of  this  strong  shear  coupling,  the  variations  of  the  axial  displacements 
along  the  axis  of  the  beam  are  connected  with  their  variations  through  the  cross-section. 
The  axial  strains  and  stresses  are  then  in  general  noniinearly  distributed  through  the 
cross-section.  For  sinusoidal  waves  of  a  given  amplitude  of  transverse  displacement  the 
amplitudes  of  distributions  of  axial  strain  and  stress  are  illustrated  in  Fig,  1  for  a  rectan- 
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gular  cross-section.  In  [8)  distributions  for  a  circular  cross-section  were  also  given.  As  p 
takes  larger  values,  the  nonlinearity  of  the  strain  and  stress  distribution  becomes  also 
more  pronounced.  As  one  reaches  values  of  p  much  larger  than  1,  axial  strains  and 
stresses  concentrate  on  the  parts  of  the  cross-section  which  are  near  the  lateral  surfaces 
of  the  beam.  A  core  of  finite  thickness  frees  itself  almost  completely  of  axial  stresses  and 
carries  only  a  constantly  distributed  shear  stress  (fig.  2).  Thus  the  unidirectionally  rein¬ 
forced  beam  behaves  like  a  "natural"  ideal  sandwich  structure. 

In  [8]  the  dispersion  relation 

c:=a-nP)"2vf  (2.2) 


(c:  phase  velocity  of  flexural  waves,  p:  specific  mass  per  unit  volume)  was  derived  for  the 
transitional  case  of  "quasi-strong"  anisotropy  in  a  beam  with  rectangular  cross-section. 
Since  this  case  corresponds  to  a  "distinguished  limit”  in  the  asymptotic  sense,  it  can  also 
be  used  in  the  case  of  moderate  anistropy  by  expanding  for  small  p.  One  obtains 
eventually 
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Fig.  1:  Distribution  of  normal  and  shear  Fig.  2:  Sandwich  behaviour  for  strong 

stress  through  the  cross-section  for  p  =  O  (1)  anisotropy  with  p  »  1 


(k  =  2  7t  /  X  =  "wave  number").  This  relation  has  also  been  derived  directly  in  (7)  for  a 
plane  bending  wave  travelling  in  one  of  the  principal  directions  of  structural  symmetry  in 
an  orthotropic  plate.  It  includes  with  the  factor  fs  the  correction  for  shear  deformation  but 
excludes  the  effect  of  rotary  inertia  which  for  large  ratios  E/G  (for  example  larger  than  10 


as  in  most  applications  to  fiber-reinforced  structures)  becomes  indeed  negligible.  In  |7| 
we  showed  how  this  relation  could  be  conveniently  used  to  measure  "global"  values  of  H 
and  G  in  the  principal  directions  of  an  orthotropic  plate  in  bending  mode  (see  also  Section 
4  of  the  present  paper). 

Experimental  work  to  check  the  dispersion  relations  for  phase  velocities  as  a  function 
of  frequency  or  wave  number  calculated  in  [8]  was  performed  by  Kolsky  and  Mosquera  (9| 
who  used  beams  with  a  rubber  matrix  reinforced  by  steel  wires.  They  covered  the  range  of 
strong  anisotropy,  changed  the  value  of  the  parameter  p  by  varying  the  frequency  and  ob¬ 
tained  excellent  agreement  with  the  theoretical  predictions.  Experimental  work  was  also 
performed  by  Goodbread  and  myself  in  our  laboratory  using  beams  with  an  Araldite  matrix 
reinforced  by  Kevlar  49  fibers  at  60  volume  %.  Results  were  briefly  presented  at  ICTAM, 
Toronto  in  1980  and  published  much  later  in  (5}.  To  vary  the  parameter  p  we  investigated 
not  only  various  frequencies  but  tried  also  to  change  the  material  parameter  em  by  heating 
the  beam  in  a  constant  temperature  oven.  Comparing  the  phase  spectra  of  the  measured 
transient  pulses  at  two  different  locations  we  obtained  the  flexural  phase  velocities  both 
as  a  function  of  temperature  and  frequency  .and  compared  them  with  theoretical  predictions 
using  values  of  tensile  and  shear  moduli  determined  in  independent  resonance  experi¬ 
ments.  The  results  given  in  some  detail  in  [5]  showed  excellent  agreement  between  the¬ 
ory  and  experiment. 

2.2  Sandwich  plates 

To  study  strong  shear  coupling  further  M.  Koller  built  in  our  laboratory  a  sandwich 
plate  exhibiting  ideal  sandwich  structural  behaviour  in  flexure.  In  this  case  only  the  facings 
should  contribute  to  the  bending  moment  and  only  the  core  should  carry  the  shear  force. 
Such  behaviour  is  assumed  a  priori  in  most  of  the  theoretical  work  on  sandwich  structures 
(see  for  example  [10,  1 1 J).  Our  study  [2]  was  aimed  at  determining  the  limits  of  applica¬ 
bility  of  the  idealized  theoretical  models.  Aluminum  or  glass  facings  of  thickness  h  -  0.5 
mm  were  combined  with  a  polyurethan  or  polyethylen  core  of  thickness  2H  =  3  mm  to  ob¬ 
tain  a  sandwich  plate  with  the  desired  characteristics.  The  ratio  of  the  shear  modulus  of 
the  core  Gc  to  the  tensile  modulus  Ef  of  the  facings  was  in  the  range  0.26  -  1.3  10  ■.  Koller 
used  the  plates  with  glass  facings  to  study  impact  by  steel  balls  of  various  sizes  (radius 
varying  from  1  to  10  mm)  in  the  elastic  range  [12].  The  peculiarities  of  sandwich  behaviour 
as  compared  to  an  isotropic  structure  can  also  be  demonstrated  under  static  loading.  Fig. 
3  illustrates  two  examples  of  the  differences  in  static  response  of  sandwich  beams  as 
compared  to  the  isotropic  counterparts.  The  original  photographs  of  the  actual  experiments 
are  published  in  \2\.  Koller  showed  in  his  impact  experiments  that  whereas  in  a  thin  iso¬ 
tropic  plate  the  cross-sections  in  the  impact  zone  are  translated  rapidly  and  remain  sta¬ 
tionary  until  waves  are  reflected  from  the  edges  of  the  plate,  in  a  sandwich  plate,  due  to 
the  shear  compliance  of  the  core  which  acts  like  a  spring,  the  cross-sections  bounce  back 
after  reaching  a  maximum  displacement  value  and  remain  stationary  at  a  lower  amplitude 
f  12]. 

The  theoretical  part  of  our  study  in  [3|  was  developed  by  asymptotic  analysis  of  the 
three  dimensional  basic  relai  as  of  linear  elasticity  for  isotropic  behaviour  of  both  facings 
and  core.  Four  competing  small  dimensionless  parameters 
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Fig.  3:  Differences  in  the  static  behaviour  of  a  sandwich  beam  and  of  its  isotropic 
counterpart 


and  their  relation  to  each  other  had  to  be  considered  (pc,  pf  :  specific  mass  of  the  core  or 
facings).  Provided  that  Em  is  sufficiently  small  with  respect  to  (Eh)3,  three  wavelengths  of 
decreasing  orders  of  magnitude  in  the  range  of  structural  wavelengths  (large  compared  to 
the  total  thickness  2H  +  2h  of  the  plate)  can  be  defined.  The  role  of  these  wavelengths 
with  respect  to  the  characteristic  wavelength  of  the  propagating  pulse  is  characterized  in 
the  table  below. 


Characteristic  wavelength 
X  comparable  with _ 


Main  dynamic  contribution  Main  dynamic  contribution 
of  the  facings  besides  inertia  of  the  core  besides  inertia 


Xi  :=  2  k  H 


In-planeforces  through 
in-plane  deformation 


Transverse  shear  force 


X2  :=  2  k  H 


X3  :=  2  7t  H 


./HZ 

V  Gc  H2 


Only  second  order  values 


Bending  moments  through 
flexural  deformation 


Transverse  shear  force 


Transverse  shear  force 


For  wavelengths  comparable  with  Xi  the  dispersion  behaviour  (phase  velocity  as  a 
function  of  frequency  or  reciprocal  wavelength)  is  similar  to  the  corresponding  behaviour  of 
a  fully  isotropic  homogeneous  plate  but  is  strongly  influenced  by  shear  deformation  even 
at  relatively  low  frequencies.  For  wavelengths  comparable  with  X2  the  phase  velocity 
takes  values  in  the  vicinity  of  the  shear  wave  velocity  of  the  core  corrected  by  the  inertia 
of  the  facings.  If  the  wavelength  is  comparable  with  X3  (which  for  Em  sufficiently  large 


with  respect  to  (eh)3  is  still  larger  than  the  total  thickness  of  the  plate)  the  flexural  be¬ 
haviour  of  the  sandwich  plate  is  dictated  by  the  flexural  properties  of  the  facings  which  be¬ 
have  like  thin  plates  on  an  elastic  foundation  with  appreciable  shear  stiffness  and  negligi¬ 
ble  compressive  or  tensile  stiffness.  As  reported  in  J3]  the  theoretically  predicted  disper¬ 
sion  curve  in  the  latter  range  of  wavelengths  comparable  with  X.3  has  been  checked  exper¬ 
imentally  for  sandwich  plates  designed  to  fulfill  the  requirements  on  the  four  small  param¬ 
eters  mentioned  above.  Radially  propagating  flexural  waves  were  generated  with  the  help 
of  a  piezoelectric  transducer  glued  on  one  of  the  facing  surfaces. sufficiently  far  from  the 
edges  to  avoid  interferences  with  reflections  during  the  recording  of  the  transient  pulse. 
The  phase  velocity  was  obtained  as  a  function  of  frequency  by  comparing  the  phase  spec¬ 
tra  of  the  displacements  at  two  or  more  radial  locations  on  the  plate.  The  transient  signals 
were  measured  with  laser-interferometry,  fed  into  a  transient  recorder  and  Fourier-ana¬ 
lyzed  subsequently.  The  fit  between  theoretical  and  experimental  results  was  remarkable, 
since  all  parameters  used  in  the  theoretical  predictions  (Ef,  Gc,  etc)  were  measured  in  in¬ 
dependent  experiments.  The  results  are  illustrated  in  Fig.  4.  A  less  spectacular  but  still 
satisfactory  agreement  between  theory  and  experiment  was  obtained  in  the  elastic  impact 
study  carried  out  by  Roller  [  1 2 J . 


Fig.  4:  Theoretical  and  experimental  dispersion  curves  for  sandwich  plates 


Current  research  work  still  in  progress  concentrates  on  certain  aspects  of  damage 
caused  on  plates  with  both  isotropic  and  fiber-reinforced  facings.  In  another  current 
research  project  possibilities  of  non-destructive  detection  of  damage  in  sandwich  plates 
are  being  investigated. 
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3.  FLEXURAL  WAVES  IN  UNIDIRECTION  ALLY  REINFORCED  PLATES 


In  the  sandwich  plates  mentioned  in  the  previous  section,  since  both  the  facings  and 
the  core  were  isotropic,  the  sinusoidal  "components"  of  the  flexural  waves  propagated  in 
the  radial  direction  generating  phase  cylinders  of  circular  shape  (phase  cylinder:  cylindrical 
surface  of  constant  phase,  its  axis  is  perpendicular  to  the  plate  and  its  intersection  with 
the  plate  surface  is  a  phase  curve).  Thus  the  phase  velocities  were  independent  of  the  ra¬ 
dial  direction  along  which  they  were  measured.  In  a  unidirectionally  fiber-reinforced  com¬ 
posite  plate  loaded  in  the  same  manner  with  the  help  of  a  transducer  mounted  on  one  of  its 
two  outer  surfaces  the  phase  of  a  sinusoidal  component  is  also  expected  to  be  constant 
along  cylindrical  surfaces  but  of  course  these  have  no  circular  symmetry  (Fig.  5). 


Fig.  5:  Flexural  waves  generated  by  a  transducer  in  a  unidirectionally  fiber-reinforced 
plate 


The  main  purpose  of  the  study  described  in  [3]  and  [13]  was  to  investigate  the  shape 
of  the  phase  curves  and  to  determine  the  dependence  of  the  radial  phase  velocity  on  the 
angle  9  with  respect  to  the  fiber  direction  xi.  A  unidirectionally  reinforced  laminate  made 
out  of  5  plies  of  0.14  mm  thickness  of  carbon-reinforced  epoxy  was  subjected  to  transverse 
loading  with  the  help  of  a  piezoelectric  transducer.  The  transient  signal  was  recorded  at 
various  radial  directions  and  distances  from  the  transducer.  From  the  difference  of  the 
phase  spectra  of  the  measured  signal  at  two  radial  locations  with  the  same  angle  with  re¬ 
spect  to  the  fiber  direction  the  wavelength  and  hence  the  radial  phase  velocity  was  de¬ 
termined  as  a  function  of  frequency  and  angle  of  propagation.  My  former  doctoral  student 
A.  Rreis  who  performed  the  experiments  concentrated  on  the  frequency  value  f  =  29.3  kHz 
and  measured  for  example  in  the  fiber  direction  the  wavelength  =  20.3  mm  and  in  the  di¬ 
rection  perpendicular  to  the  fibers  >.2  =  10.7  mm.  Since  X\  was  about  30  times  and  ki 
about  15  times  larger  than  the  total  thickness  H  =  0.71  mm  of  the  plate,  shear  deformation 
effects  were  judged  to  be  too  small  to  affect  the  behaviour  at  the  mentioned  frequency  and 
a  simple  first  order  theory  was  used  to  derive  the  governing  differential  equation  for  the 


transverse  displacement  w  (t,  xj,  X2).  The  following  equation  was  derived  by  asymptotic 
analysis  of  the  three  dimensional  basic  equations  (derivatives  are  indicated  as  usual  with 
a  comma,  the  symbols  for  the  various  moduli  of  elasticitv  are  self-explanatory): 


Ei  w.nii  +  2  (V12  E2  +  2  G12)  w,  1122  +  E2  w,2222  + 


12  P 
H2 


=  0  • 


For  sinusoidal  time  functions  of  circular  frequency  cu  and  with  the  coordinate  transforma¬ 
tions 


x;=Xl[iLPJ^]l/4 
Ei  H2 


y  :=  x2  [ 


12  p  to2 
E2  H2 


1/4 


the  differential  equation  above  was  simplified  to  obtain 

W.xxxx  +  W,yyyy  +  25  W,Xxyy  =  W  (3.1) 

for  the  amplitude  function  w  (x,y)  where 
5  E2  Vi2  +  2  G12 

Vei  e2 

is  supposed  to  be  small  and  was  evaluated  a  posteriori  on  the  basis  of  the  experimental 
results  as  0.29. 

Kreis  tried  to  solve  (3.1)  for  the  case  of  concentrated  load  by  Fourier-transform  and 
contour-integration  to  perform  the  inverse  transform.  Some  of  the  integrals  resulting  from 
this  approach  had  to  be  evaluated  numerically.  The  solution  of  (3.1)  contains  not  only  the 
cylindrical  wave  (without  circular  symmetry)  generated  by  the  transducer  but  also  the  sta¬ 
tionary  motion  whose  amplitude  decays  strongly  with  increasing  distance  from  the  trans¬ 
ducer  and  whose  role  is  to  supply  to  the  travelling  wave  the  appropriate  rhythm  of  ex¬ 
change  of  potential  and  kinetic  energy,  thus  allowing  propagation  of  the  energy  introduced 
by  the  concentrated  load.  Since  all  measurements  were  performed  at  a  sufficient  distance 
from  the  transducer  where  for  all  practical  purposes  the  stationary  motion  had  fully  de¬ 
cayed,  only  the  part  of  the  solution  which  describes  the  propagating  wave  is  really  of  inter¬ 
est  in  the  evaluation  of  the  experimental  data.  To  obtain  this  part  A.  Kreis  used  the  me¬ 
thod  of  stationary  phase  on  the  expressions  for  the  inverse  Fourier-transform  {3).  In  a  dif¬ 
ferent  approach  I  tried  recently  to  obtain  a  physically  meaningful  approximate  solution  for 
the  differential  equation  (3.1)  by  direct  asymptotic  analysis,  assuming  that  the  coordi¬ 
nates  x,  y  correspond  to  "large"  radial  distances  from  the  transducer  (distances  more  than 
three  times  larger  than  the  diameter  of  the  transducer).  The  following  procedure  proved  to 
be  quite  efficient: 

Set 


w  (x,y)=  exp  fu  (x,y)j 

in  the  differential  equation  (3.1)  and  obtain  a  differential  equation  of  the  form 
Di  fu)  +  D2  fu)  +  D3  fuj  +  D4  [u]  =  1 


(3.2) 
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where  [.]  are  nonlinear  differential  operators  involving  derivatives  of  order  k  =  1,  2,  3 
or  4.  It  can  be  shown  that  for  large  distances  from  the  origin  (0,0),  in  a  first  step  of  appro¬ 
ximation,  the  terms  with  the  higher  order  differential  operators  can  be  neglected  and  the 
differential  equation  is  reduced  to 

Di  [u]  :=  (u,x)4  +  (u,y)4  +  26  (u,x)2  (u,y)2  =  1  . 

For  5  =  0  its  exact  solution  corresponding  to  our  problem  is 


uo  (x,y)  =  -  i  fo  (x,y)  =  -  i  [x4^  +  y4^]374  . 


For  small  8  >  0  the  approximate  solution 

u  (x,y)=-if  (x,y)  =  -  i  [(1  +~82)  (x4^  +  y 4/3)-|8  y2;3]3/4  +  0(63)  (3.3) 

can  easily  be  derived  by  regular  asymptotic  expansion  in  terms  of  8,  Thus  the  phase 
curves  of  the  propagating  wave  are  given  by  f  (x,y)  =  constant.  The  next  steps  of  approx¬ 
imation  in  the  asymptotic  approach  to  (3.2)  involve  increasing  orders  of  the  operators 
Dk  Eventually  one  obtains  a  solution  of  the  form 

w  (x,y)  =  -&^Lexp  [-if  (x,y)]  .  (3  4\ 

■v  x2  +  y2  y  ’ 

The  factor  (x2  +  y2)-i/2  (s  obtained  by  considering  the  operator  D2  [u]  in  (3.2).  The 
function  g  (y/x)  which  determines  the  dependence  of  the  amplitude  from  the  angle  of 
propagation  for  large  distances  from  the  transducer  follows  by  considering  the  operator 
D3  [u]  in  (3.2). 

The  asymptotic  expansion  (3.3)  in  terms  of  8  becomes  singular  near  the  axes.  The 
word  "near"  means  in  this  case  either  y/x  =  O  (83/2)  ("near"  the  x-axis)  or  x/y  =  O  (83/2) 
("near"  the  y-axis).  Performing  the  usual  singular  perturbation  analysis  with  the  appropri¬ 
ate  boundary  layer  variable  one  obtains  for  example  near  the  x-axis  a  solution  of  the  type 
(3.4)  with 


f(x,y)=x{l+^(£)2  +  0[^-(£)4]}  .  (3.5) 

Evaluating  (3.5)  to  obtain  the  radial  phase  velocity  at  small  angles  with  respect  to  the 
fiber  direction  in  a  unidirectionally  reinforced  plate,  one  can  show  that  its  value  is  quite 
sensitive  to  small  changes  in  this  angle.  In  the  particular  plate  which  was  tested  in  our 
laboratory,  the  radial  phase  velocity  at  an  angle  of  7°  was  10  %  lower  with  respect  to  its 
value  along  the  fiber  direction.  Thus  to  obtain  accurate  values  of  the  tensile  modulus  near 
the  fiber  direction  by  measuring  the  corresponding  phase  velocity,  particular  care  must  be 
taken  to  remain  in  the  proper  radial  direction  while  choosing  the  locations  where  the  dis¬ 
placement  is  measured. 


Fig.  6:  Theoretical  phase  curve  fitted  to  experimental  results  by  choosing  Ej,  Ej 


Fig.  6  shows  phase  velocities  obtained  from  wavelengths  measured  in  various  radial 
directions.  The  theoretical  phase  curve  corresponding  to  the  developments  described 
above  has  been  fitted  to  the  experimental  results  by  proper  choice  of  two  paiameters,  i.e. 
the  two  tensile  moduli  E]  and  E2  corresponding  to  the  fiber  direction  and  the  direction  per¬ 
pendicular  to  it  respectively.  The  dimensionless  parameter  5  which  includes  both 
Poisson's  ratio  V12  and  the  shear  modulus  G[2  was  calculated  using  values  of  these  two 
material  constants  determined  independently.  Since  2x13  experimental  values  have  been 
fitted  with  only  2  parameters,  the  obtained  results  confirm  quite  successfully  (within  5.5 
m/s  variance  with  respect  to  measured  phase  velocities)  both  the  experimental  investiga¬ 
tion  and  the  theoretical  development. 


4.  EVALUATION  OF  FLEXURAL  MODULI  IN  ORTHOTROPIC  LAMINATES 


The  experimental  methods  developed  to  study  the  propagation  flexural  waves  with 
(non  circular)  cylindrical  phase  surfaces  in  unidirectionally  reinforced  laminates  have  been 
subsequently  refined  to  allow  the  systematic  experimental  evaluation  of  global  flexural 
moduli  in  orthotropic  laminates  with  arbitrary  symmetric  lay-up.  In  spite  of  the  apparent 
success  illustrated  in  Fig.  6  where,  by  appropriately  choosing  values  of  E(  and  E2,  an  ex¬ 
cellent  fit  with  measured  phase  velocity  values  along  different  radial  directions  was  ob¬ 
tained,  the  method  described  in  Section  3  has  mainly  three  weaknesses: 
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a)  Although  the  wavelength  of  Xi  =  20.3  mm  measured  in  the  fiber  direction  xj  was 
about  30  times  larger  than  the  total  plate  thickness  H  =  0.71,  the  assumption  that  shear 
deformation  effects  can  be  neglected  is  not  quite  justified.  This  is  immediately  shown  by 
evaluating  the  factor  fs  in  (2.3)  with  the  values  kjH  =  2ttH/Xi  =  0.220  and  Es/G 1 3  =  34.2. 
One  obtains  fs  =  0.926.  This  is  quite  different  from  the  ideal  value  of  1  for  which  shear  de¬ 
formation  would  be  vanishingly  small.  In  fact  a  more  accurate  analysis  including  shear  de¬ 
formation  effects  according  to  (2.3)  reveals  that  the  experimental  data  leads  to  a  value  of 
Ei  along  the  fibers  of  about  200  109  N/m2  which  is  17  %  higher  than  the  value  obtained  in 
Section  3. 

b)  Curve  fitting  with  the  phase  curve  delivers  for  all  practical  purposes  only  E]  and  £2- 
The  other  moduli  such  as  G12,  Gi3,  G23  and  V12  needed  for  an  accurate  fitting  have  to  be 
determined  from  additional  independent  experiments. 

c)  To  obtain  the  phase  curve  experimentally  a  series  of  2n,  preferably  3n  measure¬ 
ments  must  be  performed  where  n  is  the  number  of  different  radial  directions  along  which 
the  phase  velocity  has  to  be  determined.  To  measure  along  different  radial  directions  re¬ 
quires  a  certain  preparation  which  costs  time. 


Fig.  7:  Experimental  set-up  for  a  systematic  evaluation  of  laminate  stiffneses 


More  satisfactory  results  were  obtained  by  measuring  phase  velocities  only  along  the 
two  principal  axes  of  orthotropy  (n  =  2).  To  ensure  that  in  spite  of  the  various  experimen¬ 
tal  inaccuracies  to  be  usually  expected  in  such  a  reduced  programme,  the  accuracy  of  the 
global  moduli  resulting  from  the  analysis  remained  sufficiently  high,  the  phase  velocity 
was  measured  as  a  function  of  frequency  by  comparing  the  phase  spectra  at  two  or  more 
points  along  each  of  the  two  principal  directions.  Thus  for  each  pair  of  measured  signals  a 
large  number  of  phase  velocity  values  for  different  frequencies  were  produced  in  the  sub¬ 
sequent  computer  analysis  and  compared  with  theoretical  predictions  based  on  linear- 
elastic  homogeneous  behaviour  including  shear  deformation  effects.  Theoretical  results 
and  experimental  values  were  fitted  (least  squares)  by  appropriate  choice  of  the  global 
flexural  tensile  moduli  Ei,  E2  and  global  transverse  shear  moduli  G13,  G23  along  the  prin¬ 
cipal  directions.  However,  for  orthotropic  laminates  with  arbitrary  symmetric  lay-up,  one 
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has  to  bear  in  mind  that  the  global  moduli  obtained  from  flexural  modes  are  generally  ex¬ 
pected  to  be  different  from  the  ones  which  can  be  determined  in  membrane  modes  (see 
Section  6). 

The  set-up  in  our  Laboratory  corresponding  to  the  systematic  experimental  evaluation 
mentioned  above  and  main  results  for  particular  laminates  which  we  produced  at  the 
Laboratory  of  Composite  Design  in  our  University  thanks  to  the  courtesy  of  Prof.  M. 
Flemming  have  been  described  in  some  detail  in  (7|.  Fig.  7  shows  the  main  features  of  our 
experimental  set-up.  Narrow  band  flexural  pulses  with  well-defined  functions  are  intro¬ 
duced  through  a  computer  and  transferred  with  the  help  of  a  function  generator  to  a  piezo¬ 
electric  transducer  properly  glued  at  some  central  location  of  the  surface  of  a  laminate 
whose  stiffnesses  are  to  be  determined.  The  flexural  displacement  of  the  plate  is  mea¬ 
sured  at  two  or  more  locations  in  the  radial  direction  at  some  distance  from  the  transducer 
by  properly  positioning  (the  position  is  computer-controlled)  a  heterodyne  laser  interfer¬ 
ometer  measuring  displacements  in  the  direction  of  the  laser  beam  according  to  the 
Doppler-effect.  The  details  of  the  interferometer  which  was  originally  built  and  further  de¬ 
veloped  at  our  Laboratory  by  J.  Goodbread  have  been  described  in  [14].  The  demodulated 
interferometer  signal  is  fed  into  a  digital  storage  transient  recorder  and  subsequently 
transferred  to  the  computer  to  be  Fourier-analyzed  and  further  processed.  The  main  rules 
to  be  observed  in  such  experiments  have  been  listed  in  some  detail  in  [4|.  Here  three 
essential  features  will  be  briefly  mentioned: 


Fig.  8:  Obtaining  E;  and  G n  in  a  cross-ply  laminate  from  data  on  phase  velocities  in 
flexure 


(1)  The  frequency  band  of  the  signal  should  be  chosen  to  lie  well  below  the  lowest 
resonance  frequency  of  the  transducer  to  avoid  elastic  coupling  between  the  wave  genera¬ 
tor  and  the  responding  structure. 
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(2)  The  time  duration  of  the  signal  and  the  frequency  band  of  the  signal  should  be  co¬ 
ordinated  with  the  location  of  the  transducer  to  avoid  reflections  from  the  edges  of  the 
plate  which  could  interfere  with  the  measurements. 

(3)  Since  the  time  function  of  the  signal  is  well-defined,  the  same  wave  is  generated  a 
few  hundred  times  at  time  intervals  allowing  waves  generated  by  one  signal  to  decay  be¬ 
fore  the  next  signal  is  sent.  By  taking  a  simple  additive  average  of  the  hundreds  of  re¬ 
sponses  measured  at  the  same  location  (and  generated  with  the  same  time  function  at  the 
source)  the  noise  to  signal  ratio  is  decreased  considerably,  since  thanks  to  averaging 
stochastic  noise  due  to  various  mechanic  or  electronic  sources  is  either  cancelled  or  re¬ 
duced  below  levels  of  relevance. 

M.  Veidt  measured  at  our  Laboratory  pbr >e  velocities  as  a  function  of  frequency  in 
carbon-fiber  reinforced  cross-ply  laminates  which  were  subsequently  needed  to  study  dy¬ 
namic  fracture  phenomena  (see  Section  7).  Since  cylindrical  flexural  waves  travel  at  some 
distance  from  the  transducer  along  the  principal  directions  of  an  orthotropic  plate  essen¬ 
tially  like  corresponding  plane  waves  (see  Section  3)  and  since  in  cross-ply  laminates  V12 
(corresponding  to  in-plane  lateral  contraction  resilience)  is  very  low  (about  0.04)  the  re¬ 
sult  (2.2)  for  phase  velocities  can  be  directly  used  to  analyze  experimental  data  on  phase 
velocities  along  the  principal  directions  of  orthotropy  (in  [7]  the  expanded  form  (2.3)  was 
used).  Fig  8  shows  an  example  of  fitting  theoretical  predictions  to  experimental  data  for 
various  frequencies  to  obtain  both  the  global  tensile  modulus  Ei  and  the  transverse  shear 
modulus  Gn  in  flexural  mode  of  a  cross-ply  laminate  with  a  symmetric  lay-up  (0,  90,  0,  90, 
0]S- 


Fig.  9:  Filament-wound  tube  axisymmetrically  loaded  at  one  end  by  piezoelectric 
transducers 


5.  AXISYMMETRIC  WAVES  IN  FILAMENT-WOUND  TUBES 


Axisymmetric  waves  in  isotropic  tubes  have  been  studied  in  great  detail  both  experi¬ 
mentally  and  theoretically  by  a  number  of  scientists  (see  for  example  S 151).  In  contrast, 
data  on  composite  tubes  are  rather  scarce.  Thus,  as  part  of  his  PhD  work  [6|  J.  Dual  per¬ 
formed  a  series  of  systematic  experiments  on  carbon-reinforced  tubes  produced  by  fila- 


merit  winding  on  a  total  length  of  2  m  with  12  CXK)  filaments  per  roving  and  a  total  number 
of  15  layers  (fig.  9).  He  considered  5  cases  with  winding  angles  0°,  22.5°,  45°,  67.5°  and 
82°.  The  main  purpose  of  the  experiments  was  to  measure  the  dispersion  behaviour  and  to 
obtain  the  relevant  moduli  of  the  tube.  The  study  should  also  serve  as  a  basis  for  further 
investigations  regarding  non-destructive  detection  of  possible  flaws  and  structural  errors. 
Dual  used  in  his  di  sertation  a  theoretical  framework  based  on  numerical  evaluations  of 
power-series  expansions  of  the  full  three-dimensional  equations  for  homogeneous, 
orthotropic  behaviour.  Particularly  efficient  and  accurate  analytical  expressions  based  on 
an  asymptotic  analysis  of  the  three-dimensional  equations  for  the  range  of  structural 
wavelengths  covered  by  the  experiments  have  since  been  developed  and  will  be  published 
in  [16].  The  asymptotic  analysis  has  also  the  merit  of  providing  an  extremely  useful  phys¬ 
ical  insight  into  the  various  patterns  of  deformation  in  the  various  ranges  of  wavelengths 
and  into  the  corresponding  propagation  characteristics.  A  short  report  on  main  results  of 
this  analysis  and  of  its  confrontation  with  experimental  data  produced  by  Dual  has  been 
published  in  [4],  In  the  following  a  summary  of  the  salient  features  will  be  presented. 

The  asymptotic  analysis  mentioned  above  is  based  on  a  careful  exploitation  of  the 
three  small  parameters 

eE:=a  .  «*:-£  ,  eM:^ 

which  are  inherent  to  the  problem  of  axisymmetric  structural  waves  in  a  thin  orthotropic 
tube.  In  the  tubes  tested  the  value  of  the  geometric  parameter  er  was  about  0.1.  For  the 
tubes  with  0°,  22.5°  and  45°  it  was  found  a  posteriori  that  Em  ~  0.04,  0.06  and  0.2  respec¬ 
tively.  For  the  remaining  two  angles  of  winding  the  axial  modulus  E2  in  Em  has  to  be  re¬ 
placed  by  the  hoop  modulus  E3.  Thus  in  all  tubes  Em  =  O  (er)  could  be  assumed.  This  is 
the  case  which  has  been  called  in  [8]  "moderately  strong  anisotropy".  Depending  on  the 
relation  of  E\  to  Er  the  following  three  different  ranges  of  wavelengths  can  be  defined: 

1)  Wavelengths  comparable  with  the  perimeter  of  the  tube,  £\  =  O  (Er) 

The  waves  propagate  in  a  "membrane  mode"  with  both  radial  (uj)  and  axial  (U2)  com¬ 
ponents  of  the  displacement  uniformly  distributed  across  the  thickness  of  the  tube  wall. 
The  following  explicit  analytical  expression  for  the  phase  velocity  c  as  a  function  of  fre¬ 
quency  f  follows  from  the  simplified  differential  equations  of  the  asymptotic  analysis  (v  = 
V23): 


C2  = - - 

1  -  Q2  (1  -  e  v2) 


_ C o,  --  -2  aiL-f 

fWTp  ’  '  iW7p 


For  values  of  D  «  1  corresponding  to  very  large  wavelengths  (exceeding  consider¬ 
ably  the  perimeter  of  the  tube)  the  waves  propagate  nondispersively  (c  =  VE-j/p,  indepen¬ 
dently  of  the  frequency  as  long  as  fl  «  1)  under  uniaxial  stress  conditions.  The  main 
displacement  component  is  U2,  radial  displacements  uj  can  be  neglected.  For  increasing 
values  of  Q,  as  long  as  Q  <1,  the  influence  of  the  radial  displacement  component  U|  be¬ 
comes  stronger  and  causes  more  and  more  energy  to  be  canalized  to  radial  motion  rather 
than  axial  wave  propagation.  Thus  the  value  of  phase  velocity  drops.  For  values  or  Q  near 
1  the  radial  motion  becomes  so  strong  that  energy  transport  in  the  axial  direction  becomes 
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almost  impossible  in  the  membrane  mode  ("radial  resonance").  For  increasing  values 
Q  >  1,  the  radial  motion  decreases  and  again  axial  displacements  become  more  important. 
Since  in  this  case  the  wavelengths  are  smaller  than  the  perimeter  (but  still  large  with  re¬ 
spect  to  the  wall  thickness  H),  the  curvature  of  the  tube  ceases  to  be  effective  and  the 
waves  tend  to  propagate  in  a  longitudinal  nondispersive  "plate  mode"  with  negligible  hoop 

strains  and  c  =  VE2/p(l-ev2).  Of  course  this  discussion  follows  very  closely  arguments 
well-known  from  isotropic  theory  (see  for  example  [9]). 

2)  Wavelengths  comparable  with  the  geometric  average  of  radius  and  thickness, 
£R  =  O  [(£A.)2J 

Anisotropic  behaviour  shows  in  this  case  a  feature  which  is  essentially  different  from 
the  isotropic  case:  Transverse  shear  deformation  coupled  with  flexural  stiffness  in  the  ax¬ 
ial  direction  becomes  so  important  that  simplifying  a  priori  assumptions  on  the  distribution 
of  strains  and  stress  across  the  thickness  (for  example  linear  or  cubic)  can  no  longer  be 
justified  even  in  a  first  step  of  approximation.  The  proper  distribution  must  be  determined 
from  partial  differential  equations  containing  derivatives  both  with  respect  to  X2  and  to  the 
thickness  coordinate  xi  by  considering  the  boundary  conditions  on  the  stress-free  surfaces 
of  the  tube  wall.  An  exact  analytic  solution  of  the  equations  and  of  the  boundary  problem 
corresponding  to  the  first  asymptotic  step  of  approximation  can  eventually  be  found,  fol¬ 
lowing  ideas  similar  to  the  ones  developed  in  [8]  for  beams  Details  of  the  analysis  will  be 
given  in  [16].  The  phase  velocity  as  a  function  of  frequency  or  wavelength  follows  from  the 
implicit  dispersion  relations 


Q2  =  1  +  [1  -  —  ~~2h  i2^)7  ,  a  :=  -  = 

aH^2  ^  X  VeM  (1  -  e  v2) 


c  =  X  f  = 


X 

2  K  R 


Q. 


(5.2) 


3)  Transition  regime  near  frequencies  Q  ~  1 

Both  solutions  given  above  for  moderately  large  and  moderately  short  wavelengths 
and  the  corresponding  dispersion  relations  (5.1)  and  (5.2)  fail  for  values  of  Q  approaching 
1  from  the  left  side  (membrane  solution  fails)  or  from  the  right  side  (flexural  solution 
fails).  This  is  a  typical  problem  of  singular  perturbation  schemes.  It  can  be  solved  for 
sinusoidal  waves  by  careful  analysis  leading  to  a  simple  differential  equation  of  6.  order. 
The  corresponding  dispersion  relation  in  terms  of  frequency  and  wavelength  can  be 
explicitly  expressed  as 


q2’1',?v2^r)2 


(2  re  R  H)2 


12  e  (1  -ev2) 


(5.3) 


Fig.  10  shows  for  the  case  of  winding  angles  of  ±  22.5°  experimentally  determined 
phase  velocities  and  the  corresponding  theoretical  values  based  on  (5.1),  (5.2)  and  (5.3). 
To  obtain  the  values  of  the  moduli  E2,  E3  and  v  needed  in  (5.1),  the  theoretical  phase  ve¬ 
locities  were  fitted  (least  squares)  to  135  pairs  of  experimental  values  for  various  fre¬ 
quencies  in  the  range  of  long  and  moderately  long  wavelengths  corresponding  to  the  mem- 


brane  behaviour  both  for  £2  <  1  and  Q  >  1.  A  relative  variance  as  low  as  0.8  could  be 
reached.  With  such  a  high  degree  of  accuracy  one  can  safely  assume  that  the  fitting  not 
only  delivers  the  proper  values  of  the  moduli  (in  the  case  at  hand  Ei  =  75.2  H)9  N/m2.  E<  = 
1 1.55  109  N/m2  and  v  =  V23  =  1.275)  but  also  a  meaningful  confirmation  of  the  quality  of 
the  experiments  and  of  the  adequacy  of  the  asymptotic  theory  leading  to  the  simple  rela¬ 
tion  (5.1).  Once  the  moduli  E2,  E3  and  v  were  determined,  the  theoretical  predictions 
based  on  (5.2)  and  (5.3)  were  fitted  to  the  experimental  phase  velocities  in  the  range  of 
moderately  short  wavelengths  (involving  flexure  of  the  tube  wall  and  transverse  shear)  by 
proper  choice  of  the  transverse  shear  modulus  G12.  Using  a  set  of  153  pairs  of  experimen¬ 
tal  values  for  the  tube  with  ±22.5°  winding  angle  the  value  G12  =  4.3  1()9  N/m2  could  be 
determined  with  0.9  %  relative  variance.  If  one  excludes  the  first  3  of  the  153  experimental 
pairs  of  values  where  measurements  are  quite  difficult  since  they  occur  in  the  transitional 
range  corresponding  to  the  vicinity  of  the  critical  value  £2  =1.  the  variance  drops  even  to 
less  than  0.2  %.  Again  this  remarkable  fit  with  just  one  parameter  may  be  considered  as  a 
brilliant  confirmation  of  both  the  experimental  work  of  Dual  and  of  the  efficiency  of  the 
asymptotic  approach  used  in  the  theoretical  modelling  leading  to  (5.2). 
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Fig.  10:  Obtaining  E2,  E3,  V23  and  G12  in  a  filament-wound  tube  from  data  on  phase 
velocities 


6.  IN-PLANE  WAVES  IN  A  CROSS-PLY  LAMINATE 


Cross-ply  laminates  which  were  studied  in  flexural  wave  propagation  modes  according 
to  the  program  described  in  Section  4  were  also  subjected  to  transient  loading  by  torsional 
piezo-electric  transducers  generating  in-plane  waves  (Fig.  11).  A  short  account  of  the  ex¬ 
perimental  and  theoretical  results  has  been  given  in  |4|.  A  more  detailed  treatment  will  be 
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presented  in  [17].  In  the  following  the  main  ideas  leading  to  the  theoretical  prediction  of 
the  phase  curves  will  be  briefly  reviewed.  These  theoretical  phase  curves  have  been  com¬ 
pared  with  experimental  data  produced  by  A.Eisenhut  at  our  Laboratory  on  a  particular 
laminate  produced  out  of  9  carbon-fiber-reinforced  laminae  arranged  symmetrically  as  (0, 
90,  0,  0,  90,  0,  0,  90,  0)-plies  with  respect  to  the  xj-axis. 


p=  1.52  103kg/m3 
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Fig.  11:  In-plane  waves  in  a  cross-ply  laminate  generated  by  a  torsional  transducer 


With  xa  as  coordinates  along  the  principal  directions  of  orthotropy  a  =  1,  2  the  basic 
equations  of  linear  elastodynamics  lead  in  the  case  of  in-plane  waves  in  an  orthotropic 
plate  to  the  differential  equations 

U],n  +  52  U2,12  +  E2  Uj,22  =  (Cl)-2  Uj,tt  , 

(6.1) 

E2  U2>n  +  52  Uj ,21  +  e2  u2>22  =  (Cl)'2  U2,tt 


w  nere 


e*  := 


-_E2 


G12  [1  -  (e  V12)2] 
El  ” 


52  :=  £2  +  e2  V12  ,  (cj)2  := 


p  [1  -  (e  V12T 


In  a  cross-ply  laminate  Ei,  E2  and  V12  are  global  in-plane  moduli  of  elasticity  correspond 
ing  to  the  moduli  of  the  equivalent  homogeneous  orthotropic  plate.  Whereas  e2  can  be  ex¬ 
pected  to  take  a  value  of  about  0.5  in  the  laminate  mentioned  above  with  twice  as  many 


0°-plies  than  90°-plies,  £ 2  should  be  an  order  of  magnitude  smaller  (for  example  0.05) 
since  the  cross-ply  laminate  is  quite  soft  in  shear.  Also  the  global  coefficient  of  lateral  con¬ 
traction  V12  should  be  small  (about  0.04),  so  that  the  value  of  82  is  not  expected  to  exceed 
0.07.  To  derive  the  shape  of  the  phase  curve  at  distances  from  the  transducer  where  for  ail 
practical  purposes  a  sinusoidal  mode  of  propagation  can  be  assumed  (distances  larger 
than  1  wavelength)  set 

ua  =  A«  exp  [i  co  t  -  i  f  (x,  y)|  ,  (a  =1,2)  ,  (x,  y) .-  ^  (xi,  x2)  (6.2) 

in  (6.1)  to  obtain  a  set  of  two  linear  homogeneous  equations  for  Ai  and  A2  which  only 
admit  non-trivial  solutions  if  the  determinant  of  the  coefficients  of  Aj  and  A2  vanishes. 
This  leads  to  a  differential  equation  for  f  (x,  y)  of  the  form 

di  [f]  +  id2ffl  +  d3ff]=0  (6.3) 

where  d2  (.1  is  a  nonlinear  differential  operator  involving  products  of  first  and  second 
derivatives  of  f  (x,  y)  and  d3  { .  J  is  a  nonlinear  differential  operator  containing  only  second 
derivatives  of  f  (x,  y).  The  explicit  expression  for  the  first  differential  operator  is 

di  ff]  :=  ((f,x)2  +  £2  (fy)2  -1]  [e2  (f,x)2  +e2  (f,y)2  -11  -  84  (f„  f,y)2  .  (6.4) 

Since  at  large  distances  (larger  than  at  least  one  wavelength)  from  the  transducer  cen¬ 
tered  at  the  origin  (0,  0)  the  terms  d2  [fl  and  d3  If}  with  the  derivatives  of  second  order 
can  be  neglected  (see  117}  for  details,  the  arguments  are  similar  to  those  used  in  connec¬ 
tion  with  (3.2)),  the  expression  for  the  phase  curve  can  be  obtained  by  setting  di  jfj  =  0 
according  to  (6.3).  Using  further  that  84  in  the  expression  (6.4)  is  a  small  quantity  (less 
than  0.005)  one  obtains  the  following  two  possible  expressions  for  f  (x,y): 

fO)  =  [x2(] +§l)  +  yijt/2  +  0(88)  ,  f<2>  =  pi  +  (1  +&i)]1/2  +  0{58)  .  (6.5) 

e2  e2  e2  e2  e2 

Whereas  in  isotropic  structures  two  solutions  corresponding  to  rotational  (with  dis¬ 
placements  in  the  hoop  direction)  and  dilatational  (with  displacements  in  the  radial  direc¬ 
tion)  motion  and  circular  phase  curves  are  expected,  in  the  cross-ply  laminate  at  hand  the 
tw'o  possible  phase  curve  shapes  are  obviously  elliptic  with  large  excentricities.  One  can 
further  show  that  in  a  first  step  of  approximation  is  associated  only  with  the  displace¬ 
ment  component  ui  along  the  first  principal  direction  of  orthotropy  and  f(2)  with  the  dis¬ 
placement  component  U2  along  the  second  principal  direction  of  orthotropy.  The  low  shear 
stiffness  (G j 2  «  Ej,  E2)  is  responsible  for  this  decoupling  phenomenon  with  respect  to 
the  principal  directions.  Thus  along  the  xj-axis  f’l  corresponds  to  pure  dilatational  motion 
and  along  the  x2-axis  to  pure  shear  motion.  Conversely  F2>  corresponds  to  pure  shear  mo¬ 
tion  along  the  xj-axis  and  to  pure  dilatational  motion  along  the  x2-axis.  Furthermore,  the 
cylindrical  waves  with  elliptic  phase  curves  propagate  obviously  without  dispersion,  since 
the  radial  phase  velocities  do  not  depend  on  the  frequency,  only  on  the  angle  with  respect 
to  the  first  principal  direction  X(. 

In  experimental  work  leading  to  his  diploma  thesis  A.  Eisenhut  studied  at  our 
Laboratory  in-plane  waves  produced  by  a  torsional  transducer  and  measured  the  arrival 
times  of  both  waves  carrying  displacements  ui  and  u2  respectively  at  various  radial  loca- 
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tions  as  a  function  of  the  angle  cp  with  respect  to  xj.  Thus  he  obtained  the  radial  velocities 
cri  (<p)  and  cr2  (cp)  for  both  waves.  For  each  value  of  tp  two  pairs  of  measured  radial  ve¬ 
locities  were  available.  1  fitted  (least  squares)  the  experimental  results  to  the  theoretical 
predictions  by  properly  choosing  the  values  of  the  moduli  of  elasticity  E],  E2  and  Gj2  (the 
influence  of  V12  with  expected  values  less  than  0.05  is  negligible)  and  obtained  the  results 
depicted  in  Fig.  12  which  exhibit  some  problems  in  the  dilatational  modes  with  large  ve¬ 
locities  near  the  principal  axes  of  orthotropy  where  the  difference  of  the  arrival  times  be¬ 
tween  two  locations  were  too  short  (7-10  ps)  to  allow  accurate  measurements.  Neverthe¬ 
less,  in  the  shear  modes  with  low  velocities  near  the  principal  axes  and  at  larger  angles 
with  respect  to  those  axes  the  accuracy  of  the  fit  is  quite  satisfactory  (less  than  3  % 
relative  variance)  and  confirms  both  the  quality  of  the  experimental  study  and  the  validity 
of  the  corresponding  theoretical  development. 


Fig.  12:  Phase  curves  of  cylindrical  in-plane  waves  in  a  cross-ply  laminate 
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It  is  interesting  to  compare  results  for  the  moduli  obtained  from  in-plane  waves  as 
discussed  in  the  present  section  with  those  generated  by  M.  Veidt  on  the  same  laminate 
in  the  context  of  his  doctoral  work  by  using  flexural  modes  as  described  in  Section  4  (see 
[7]  for  more  details). 
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The  appreciable  difference  in  the  values  of  E2  for  the  two  modes  is  most  probably  due 
to  the  fact  that  in  the  flexural  mode  the  central  90°-ply  with  its  fibers  parallel  to  the  X2-di- 
rection  contributes  almost  nothing  to  the  global  value  of  E2  whereas  in  the  membrane  (in¬ 
plane)  mode  it  is  as  effective  as  the  other  two  90°-plies  of  the  laminate  with  its  (0,  90,  0, 
0,  90,  0,  0,  90,  0)  lay-up.  Thus  E2  (membrane)  >  E2  (flexural).  Besides,  since  there  are 
twice  as  many  0°-plies  as  90°-plies,  the  value  of  Ei  in  the  membrane  mode  is  roughly 
twice  as  large  as  E2. 


7.  APPLICATIONS  TO  FAST  FRACTURE  IN  CROSS-PLY  LAMINATES 


In  the  context  of  experimental  work  leading  to  his  doctoral  thesis  118]  E.  Moor 
showed  in  our  Laboratory  that  in  thin  carbon-fiber  reinforced  laminates  submitted  to  ten¬ 
sile  loading  and  conveniently  notched  as  in  classical  tests  of  fracture  mechanics,  a  crack 
perpendicular  to  the  direction  of  the  load  propagates  from  the  tip  of  the  notch  towards  the 
other  edge  of  the  plate  with  an  almost  constant  velocity  reaching  values  in  the  range  of 
500  -1200  m/s  for  the  various  lay-ups  tested.  Moor  used  shadow-optical  methods  [19,  20] 
to  provide  experimental  data  on  the  stress  field  first  near  the  tip  of  the  crack  both  before 
and  after  it  started  running.  Preliminary  calculations  by  M.  Veidt  [21]  showed  that  the  re¬ 
sults  could  be  interpreted  by  assuming  that  the  laminate  behaved  as  a  homogeneous 
orthotropic  structure,  provided  that  the  global  shear  modulus  near  the  crack  tip  was  ad¬ 
justed  to  account  for  the  fact  that  damage  near  the  crack  tip  substantially  decreases  lo¬ 
cally  the  global  shear  stiffness  characteristics  of  the  structure. 

To  calculate  the  stress  and  strain  field  near  the  tip  of  a  crack  running  with  constant 
velocity  v  along  one  of  the  principal  directions  of  orthotropy  xj  of  a  thin  orthotropic  plate 
we  may  use  relations  (6.1)  and  look  for  a  stationary  solution  by  assuming  that  the  stress 
and  strain  fields  remain  stationary  with  respect  to  a  frame  moving  with  the  crack  tip  (this 
is  of  course  acceptable  only  as  long  as  reflections  from  the  edges  of  the  structure  can  be 
neglected).  Thus  we  set 

x  :=xj  -vt 

and  look  for  a  solution  of  (6.1)  of  the  form  ua  (x,  X2).  The  transformed  relations  (6.1)  are 


(1  -  Y2)  Ui,xx  +  52  U2,x2  +  E2  ui,22  =  0  , 

(e2  -  Y2)  U2,xx  +  S2  ui,2x  +  e2  U2,22  =  0  - 


(7.1) 
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where  y  :=  v  /  c\.  For  the  carbon-fiber  cross-piy  laminates  tested  in  our  laboratory  the 
value  of  y  was  in  the  range  0.10  -  0. 15,  so  that  y  <  £. 

We  now  look  for  a  particular  solution  of  (7.1)  which  is  derivable  from  a  potential  ac¬ 
cording  to 


Ui=<E>,x  ,  u2  =  q  <J>,2 


(7.2) 


where  q  is  a  constant  factor  which  can  be  adjusted  to  fulfill  both  equations  (7.1)  with  the 
single  function  <D  (x,  x2).  From  (7.1)  and  (7.2)  after  integrating  the  first  equation  with  re¬ 
spect  to  x  and  the  second  with  respect  to  x2  one  obtains 


(1  -  y2)  <D,XX  +  (q  52  +  e2)  Ol22  =  0  , 

(q(e2-y2)  +  52}<D,xx+qe2  O,22  =  0  , 


(7.3) 


O  (x,  x2)  is  a  solution  of  both  equations  if  these  are  identical,  i.e.  if  the  coefficients  of  0,xx 
and  0,22  in  the  first  and  second  equation  are  proportional.  This  leads  to  the  following  two 
values  for  the  parameter  q: 


T2  = 


p±Vp2  -4(£2  -y2)  84  e2 
2  (e2  -y2)  52 


P  :=  (1  -  y2)  e2  -  84  -  (e2  -y2)  £2 


(7.4) 


Thus  the  general  solution  for  (7.1)  can  be  written  as  a  sum  of  two  particular  solutions 

ui  =  ^lix  +  ^2’X  ,  u2  =  qi  Oi,2  +  q2  02,2  -  (7.5) 

where  according  to  (7.3)  the  functions  Oj  (x,  x2)  and  02  (x,  x2)  satisfy  the  uncoupled  dif¬ 
ferential  equations 

(1  -  Y2)  Oi,xx  +  (q1  62  +  e2)  0],22  =  0  , 

(7.6) 

(1  -  Y2)  02,xx  +  (q2  52  +  £2)  02l22  =  0  , 

which  can  easily  be  expressed  as  Laplace  differential  equations  by  transforming  x2  in  each 
equation  and  solved  by  classical  methods  of  potential  theory  . 

The  shadow-optical  data  gathered  by  E.  Moor  near  the  tip  of  cracks  in  cross-ply  lami¬ 
nates  fall  mainly  into  the  two  patterns  (a)  and  (b)  illustrated  in  Fig.  13.  The  pattern  (a 
occurs  at  the  tip  of  the  notch  before  the  crack  starts  running,  actually  at  loads  well  be  lot 
(less  than  20  %)  the  fracture  load.  Pattern  (b)  has  been  observed  at  the  tip  of  a  running 
crack.  These  patterns  can  be  interpreted  and  simulated  on  the  basis  of  the  theoretical 
background  presented  above  which  allows  the  prediction  of  the  displacement,  strain  and 
stress  fields  near  the  tip  of  a  crack  running  at  constant  velocity,  provided  that  refieeions 
from  the  edges  of  the  structure  can  be  neglected.  M.  Veidt  has  shown  that  the  main  differ¬ 
ences  in  the  shapes  (a)  for  a  cross-ply  laminate  and  (c)  for  an  isotropic  structure  is  due  to 
the  much  lower  ratio  £m  (shear  modulus  /  tensile  modulus)  in  the  former  case.  The  shape 
(b)  can  be  theoretically  simulated  by  assuming  that  the  shear  modulus  is  much  lower  (by 


a  factor  of  about  3)  than  the  one  measured  in  the  undamaged  cross-ply  laminate.  These 
findings  could  also  be  substantiated  by  E.  Moor  with  the  help  of  direct  strain  measure¬ 
ments  with  strain  gauges  conveniently  placed  in  the  vicinity  of  the  running  crack.  The  de¬ 
tails  of  these  experimental  studies  and  of  the  theoretical  developments  required  for  their 
interpretation  and  evaluation  will  be  published  elsewhere. 


(a)  Low  load,  stationary  (b)  Fracture  load,  running  (c)  IsotTOpiC  structure 

crack,  cross-ply  structure  crack,  cross-ply  structure  r 


Fig.  13:  Measured  patterns  of  shadows  at  the  crack  tips  of  cross-ply  and  isotropic 
structures 
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energy  dissipation  in  high-Reynolds-number  turbulence 
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Abstract 

We  begin  with  a  brief  description  of  the  multiplier  distribution  for  er, 
the  average  over  a  linear  interval  r  of  the  energy  dissipation  rate,  e. 
Using  measured  multiplier  distributions  obtained  for  atmospheric 
surface  layer  data  on  e,  we  show  that  quasi-deterministic  multiplicative 
models  for  bases  2  and  3  (that  is,  binary  and  tertiary  breakdown 
processes)  can  be  developed  on  a  rational  basis.  For  r  in  the  inertial 
range,  moments  computed  up  to  a  fairly  high  order  from  these  models 
are  found  to  be  in  good  agreement  with  experimental  values.  For  bases 
larger  than  three,  such  quasi-deterministic  approximations  for 
multiplier  distributions  are  not  possible.  Some  applications  of  multiplier 
distributions  are  presented. 

1.  INTRODUCTION 

A  Gaussian  process  is  completely  described  in  a  statistical  sense  by  its 
mean  and  standard  deviation.  It  is  conceivable  that  a  nearly  Gaussian 
process  can  be  described  well  by  its  first  few  moments  -  at  least  well 
enough  for  many  purposes.  This  is  the  situation  with  respect  to  velocity 
or  temperature  traces  obtained  in  high-Reynolds-number  fully 
turbulent  flows  not  too  close  to  the  wall.  On  the  other  hand,  the 
situation  is  quite  different  for  quantities  such  as  the  energy  dissipation 
rate,  e,  in  high-Reynolds-number  turbulence.  Figure  1  is  (effectively)  a 
one-dimensional  section  through  the  field  of  e  in  the  atmospheric 
surface  layer  a  few  meters  over  land.  In  contrast  to  Gaussian  or  nearly 
Gaussian  processes,  information  about  the  first  few  low-order  moments 
does  not  describe  the  signal  in  any  detail.  Peaks  which  are  hundreds  of 
times  the  mean  are  not  uncommon,  and  the  signal  is  at  other  times  of 
very  low  amplitude;  this  strongly  intermittent  character  is  a  generic 
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Figure  1:  A  typical  signal  of  a  representative  component  of  e,  namely  e' 
=  (du/dx)2  ~(du/dt)2,  normalized  by  its  mean.  Here,  u  is  the  velocity 
fluctuation  in  the  direction  x  of  the  mean  velocity  U.  In  writing  the  last 
step  of  the  above  approximation,  it  has  been  assumed  that  Taylor's 
frozen  flow  hypothesis,  namely  that  the  spatial  derivative  can  be 
approximated  by  the  temporal  derivative,  holds.  The  velocity 
fluctuation  u  was  obtained  by  a  hot-wire  mounted  on  a  pole  6  m  above 
the  ground  level  over  a  wheat  canopy.  The  microscale  Reynolds  number 
is  of  the  order  2500. 
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property  of  e  in  high-Reynolds-number  turbulence.  Its  understanding 
and  modeling  is  important  to  any  practical  scheme  for  computing 
turbulent  flows.  The  intermittency  has  important  implications  also  in 
contexts  such  as  the  structure  of  turbulent  flames. 

In  the  last  few  years,  much  work  based  on  multifractals  has  occurred 
on  the  description  and  modeling  of  the  intermittent  character  of  energy 
dissipation  rate  (and  other  similar  characteristics).  For  a  summary,  see 
[1],  In  Refs.  [2-4],  several  simple  quasi-deterministic  multifractal 
models  were  shown  to  describe  the  statistical  properties  of  the  energy 
dissipation  rate  quite  accurately.  Here,  we  provide  an  organized  basis 
for  developing  such  simplified  intermittency  models. 

The  energy  dissipation  rate  is  a  positive  definite  quantity  which  is 
additive  (in  the  sense  that  e  over  two  non-overlapping  intervals  equals 
the  sum  of  e  values  distributed  over  the  sum  of  the  two  intervals).  In 
this  sense,  it  is  convenient  to  think  of  e  as  a  measure  distributed  on  an 
interval. 


2.  MULTIPLIER  DISTRIBUTIONS 

Consider  a  long  data  string  of  e  distributed  over  an  interval  which  is  N 
integral  scales  in  extent,  N  being  some  large  integer.  Divide  the  interval 
into  'a'  equal-sized  sub-intervals,  and  obtain  the  ratios  of  the  measures 
in  each  of  the  sub-intervals  to  that  in  the  entire  interval.  These  ratios, 
to  be  called  multipliers,  are  clearly  positive  and  lie  between  zero  and 
unity.  Subdivide  each  sub-intervals  into  ’a’  pieces  as  before,  and  repeat 
the  procedure.  When  we  reach  sub-intervals  of  the  size  of  the  integral 
scale  of  turbulence,  L,  there  will  be  sufficiently  large  population  of  the 
ratios  Mj(L>,  l<i<N,  and  one  can  obtain  a  converged  histogram  of  the 
multipliers  Proceed  with  further  subdivisions.  At  the  n-th 

subsequent  level,  where  each  sub-interval  is  of  size  r/L  =  a  n,  there  are 
N.an  multipliers  Mj<r).  Construct  the  histogram  of  the  multipliers  at  each 
level.  Repeat  the  procedure  until  the  smallest  sub-interval  reached  is  of 
the  order  of  the  Kolmogorov  scale. 

The  thought  behind  this  hierarchical  construction  is  that  the  nonlinear 
processes  occurring  in  the  inertial  range  of  scales  may  be  abstracted  by 
a  breakdown  process  in  which  each  eddy  subdivides  into  'a'  pieces,  with 
the  energy  flux  redistributed  in  some  unequal  fashion  without  loss 
among  the  sub-eddies;  since  the  energy  flux,  as  it  cascades  down  to 
smaller  scales,  is  ultimately  converted  into  energy  dissipation,  the  two 
quantities  are  equal  on  the  average.  It  is  further  thought  that  this 
unequal  distribution  among  sub-eddies  is  the  heart  of  the  observed 
intermittency.  The  reality  is,  of  course,  more  complex.  For  instance,  it  is 
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not  obvious  what  the  appropriate  value  of  'a'  (that  is,  the  base  for  the 
cascade  process)  must  be,  or  whether  it  remains  the  same  from  one  step 
of  the  cascade  to  another.  In  spite  of  this  ambiguity,  it  is  clear  that  if 
there  is  a  scale-similar  breakdown  occurring  in  the  cascade  process,  the 
histograms  of  the  multipliers  should  be  identical  it  each  step  of  the 
subdivision  or  cascade. 

The  probability  density  P(M)  of  the  multipliers  M  -  here  and 
subsequently,  we  omit  the  indices  on  the  Mj<r>  and  denote  them  simply 
by  M  -  have  been  obtained  for  different  stages  of  subdivision  of  the 
interval.  Since  the  value  of  ’a*  is  not  known  a  priori ,  Chhabra  & 
Sreenivasan  [4]  obtained  P(M)  for  various  bases.  Figure  2  shows  the 
results  for  'a'  =  2,  3  and  5.  The  shape  of  each  of  the  distributions  is 
invariant  over  a  certain  range  of  scales,  suggesting  that  some  type  of 
self-similarity  occurs  in  this  scale  range,  whatever  the  assumed  base. 
This  range  of  scales  over  which  P(M)  is  self-similar  agrees  quite  well 
with  the  inertial  range  of  scales  determined  by  the  scaling  range  in 
spectra  and  structure  functions.  The  larger  symbols  show  an  average 
over  steps  involving  comparisons  between  boxes  of  size  'm'  and  those  of 
size  'm*a',  where  m  ranged  from  50  to  1000  in  units  of  sampling 
intervals.  (For  the  very  smallest  scales,  the  distributions  have  a  concave 
shape.  This  concavity  is  related  to  the  divergence  of  moments  [5]  and 
will  be  discussed  elsewhere.  For  very  large  box-sizes,  multiplier 
distributions  approach  a  delta  function  centered  around  0.5,  as  would 
be  the  case  for  random  measures.) 

The  scale-invariant  multiplier  distributions  obtained  in  figure  2  are 
fundamental  to  the  understanding  of  the  observed  multifractal  scaling 
[2].  One  can  compute  [4]  from  them  not  only  the  asymptotic  scaling 
properties  such  as  the  multifractal  spectrum  (or  the  f(a)  curve  [6])  of  a 
measure,  but  also  finite-size  fluctuations  of  scaling  properties  [4].  In 
addition,  even  in  instances  where  high-order  moments  diverge,  P(M) 
remains  well-defined.  Finally,  the  f(a)  function  may  extend  over  (-<»too) 
whereas  P(M)  is  a  compact  function  defined  on  M  e  [0,1]. 

A  disadvantage  of  P(M)  is  that  it  is  base -dependent.  However,  if  the 
cascades  giving  rise  to  the  observed  intermittency  are  randomly 
multiplicative,  then  the  multiplier  distributions  corresponding  to 
different  bases  are  related  by  convolution,  and  one  can  scale  out  this 
base-dependency  [4].  If  the  multiplicative  process  is  random  (i.e., 
successive  multipliers  are  uncorrelated)  several  base-independent 
functions  can  be  constructed  from  these  multiplier  distributions.  In 
particular,  for  any  two  bases  'a'  and  'b',  we  have 


log<(Ma)q>/log(a)  =  log<(Mb)q>/log(b)  =  ~[x(q)  +  Do] 


(1) 
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Figure  2:  Multiplier  distributions  P(M)  for  bases  (from  right  to  left)  'a'  = 
2,  3  and  5.  The  larger  symbols  show  averaged  multiplier  distribution, 
which  are  the  mean  of  multiplier  distributions  obtained  by  comparing 
measure  in  boxes  of  size  'm'  to  those  of  ’m*a'  ,  where  m  ranged  from  50 
to  1000  in  units  corresponding  roughly  to  the  Kolmogorov  scale.  The 
smaller  symbols  show  the  distributions  obtained  for  m  =  50,  80,  150, 
200,  400  and  1000.  The  solid  line  is  the  triangular  approximation  to  the 
binary  case.  The  figure  is  adapted  from  [4], 
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where  t(q)  =  (q-l)Dq,  Dq  being  the  so-called  generalized  dimensions  (7J 
of  q-th  order,  and  Do  -  Dq=o  is  the  fractal  dimension  of  the  support  of 
the  measure.  The  f(a)  function  can  also  be  easily  derived  from  the 
multiplier  distribution.  In  Ref.  [4],  it  was  shov*  that  the  f(a)  functions 
computed  from  these  different  distributions  were  in  good  agreement 
with  each  other  as  well  as  with  those  obtained  from  direct  methods 
such  as  box-counting.  This  agreement  indicates  the  existence  of  a 
probablistic  cascade  where  no  single  base  is  preferred. 

Incidentally,  a  good  approximation  for  P(M)  in  the  binary  case  is  the 
triangular  distribution  shown  in  figure  2.  In  Ref.  [4]  it  was  shown  that 
the  f(a)  function  for  this  model  is  in  excellent  agreement  with  that 
obtained  directly  [3].  Further,  the  triangular  model  displays  the  correct 
behavior  with  respect  to  sample-to-sample  fluctuations  in  f(a).  It  also 
reproduces  the  stretched  exponential  tails,  P(e)  ~  exp(-p(e)1/2),  observed 
in  Refs.  [3,8]  for  the  probability  distribution  of  e. 

3.  SIMPLE  MODELS 

The  multiplier  distributions  shown  in  figure  2  are  extracted  directly 
from  the  experiment  and  their  analytical  forms  are  yet  to  be  found 

from  the  theory.  The  question  meanwhile  is  a  simple  representation  of 

these  distributions  in  a  way  that  permits  one  to  evaluate  most  of  the 

measured  properties  quite  accurately.  The  goal  is  to  seek  models  that 

are  simple  enough  to  be  tractable  mathematically  and  realistic  enough 
to  represent  the  spirit  of  the  underlying  physics.  We  already  mentioned 
the  triangular  distribution  as  a  good  approximation.  An  even  simpler 
possibility  is  the  p-model  [2],  which  is  a  model  for  a  binary  cascade 
(’a'=2).  We  first  discuss  the  p-model  and  show  how  it  can  be  obtained  as 
a  rational  approximation  to  the  measured  multiplier  distribution  for  the 
binary  case.  We  will  then  discuss  how  models  in  the  same  spirit  can  be 
obtained  for  the  tertiary  case  ('a’=3).  The  limitations  of  the  procedure 
for  high  order  subdivisions  (’a'>3)  will  be  highlighted. 

From  a  physical  point  of  view,  the  cascading  process  with  ’a'=2  can  be 
thought  of  as  the  break-up  of  a  structure  (the  parent  structure  or  eddy) 
into  two  sub-structures.  For  the  one-dimensional  case  corresponding  to 
figure  1,  a  pertinent  question  is  the  following:  is  there  any  difference 
between  the  left  and  right  offsprings  in  terms  of  the  energy  flux  they 
receive  from  the  parent  structure?  One  can  determine  experimentally 
that  left  and  right  are  statistically  indistinguishable.  (This  is  not  true  for 
the  velocity  signal  itself,  as  can  be  concluded  from  Kolmogorov's  4/5 
law  [9]).  Now,  for  the  sake  of  simplicity  and  modeling,  let  us  assume 
that  one  of  the  two  sub-eddies  always  receives  a  fixed  fraction  p  of  the 


1  og2<Mq> 


4UI 


order  of  the  moment  q 

Figure  3:  A  comparison  between  moments  computed  from  the  measured 
multiplier  distributions  and  those  computed  for  the  different  models 
considered  in  the  text.  Experimental  data  were  obtained  from  a  record 
length  of  810,000  data  points.  The  convergence  of  moments  was 
reasonable;  for  example,  in  the  last  half  decade  of  the  record  length,  the 
variations  observed  were  smaller  (in  the  log  scale)  than  the  symbol  size. 
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energy  contained  in  the  parent  eddy;  naturally,  the  other  will  receive  1- 
p.  In  this  sense,  this  model  has  some  determinism.  However,  it  is  only 
quasi-deterministic  in  the  sense  that  either  one  of  the  two  eddies  could 
receive  the  fraction  p;  because  of  the  left-right  symmetry  mentioned 
above,  a  given  piece  will  receive  p  as  often  as  1-p.  Then,  the  multiplier 
distribution  for  the  p-model  becomes 


S(M-p)  +-  5(M-(l-p)) 
P’a'=2(M)  =  9 


(2) 


If  p=l/2,  there  is  no  intermittency  and  the  physical  situation 
corresponds  to  Kolmogorov’s  1941  theory  [10].  To  obtain  intermittency, 
we  should  have  a  value  of  p  different  from  1/2. 

How  can  we  choose  p?  A  natural  way  is  to  match  the  moments  of 
p'a'=2(M)  with  those  of  the  real  p(M).  For  both  distributions,  the  zero- 
order  moment  (normalization)  and  the  first-order  moment  (mean  value) 
coincide,  and  are  1  and  1/2,  respectively.  The  first  non-trivial  condition 
is  to  match  the  second  order  moment.  When  this  is  done,  we  obtain  the 
value  p=0.697,  or  l-p=0.303,  which  can  be  rounded  off  to  excellent 
accuracy  by  0.7  and  0.3,  respectively.  This  is  the  p-model  of  Ref.  [2].  It 
turns  out,  purely  by  luck,  that  high-order  moments  computed  for  the  p- 
model  also  agree  with  those  computed  for  the  real  data  (see  figure  3).  It 
had  been  shown  in  [2]  that  the  f(a)  spectrum  for  the  binary  p-model 
with  p  =  0.7  fit  the  experimental  data  quite  accurately. 

We  now  discuss  a  general  scheme  for  developing  for  all  ’a'*2  quasi- 
deterministic  models  of  the  sort  developed  above  for  the  p-model. 
Again,  we  attempt  to  do  this  by  matching  moments.  The  general 
multiplier  distribution  for  any  'a'  in  the  p-model  scheme  is 


l  S(M-Pi) 

P’a'(M)  =  - 

where 

£pi  =  l,  (0spi<l ) 
i=l 


(3) 


(4) 


We  may  now  equate  the  moments  of  P-a(M)  to  the  moments  of  the  real 
multiplier  distributions.  Since  multiplier  distributions  for  any  base  yield 
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order  of  the  cascade 


Figure  4.  Values  of  the  multipliers  for  the  binary  (diamonds)  and 
tertiary  (circles)  p-models.  The  letter  K  indicates  the  location  of  the 
multipliers  if  there  were  no  intermittency,  consistent  with  Kolmogorov's 
1941  theory  [10]. 
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the  same  results,  we  may  take  the  distribution  corresponding  to  the 
binary  cascade  of  figure  2.  Computing  the  moments  <M{1>  from  it,  we  are 
led  to  the  equations 

a 

^  p]  =  a  <Mq>lo8<a)/loS<2>,  q=l,  2,  ....  a.  (5) 

i=l 

This  is  a  system  of  'a'  equations  with  'a'  number  of  p;'s  to  be 
determined.  Using  Girard's  rule  [11],  it  is  easy  to  find  a  polynomial  of 
degree  ’a'  whose  roots  are  the  desired  pj's.  The  problem  thus  reduces  to 
the  determination  of  the  roots  of  the  polynomial.  It  turns  out  a 
posteriori  that  this  problem  has  physical  solutions  only  for  'a'=2  and 
’a’=3:  for  larger  values  of  ‘a’,  some  of  the  roots  turn  out  to  be  complex, 
and  have  no  physical  meaning.  The  values  of  pj  for  the  tertiary  cascade 
(’a'=3)  are  p^O.155,  p2=0.283  and  p3=0.562.  We  designate  this  as  the 
tertiary  p-model.  In  this  scheme,  the  classical  Kolmogorov  theory  would 
yield  pt=  p2=  p3=  1/3. 

The  values  of  p*  for  the  binary  and  tertiary  cascade  are  shown  in 
figure  4;  also  marked  by  K  are  the  classical  non-intermittent  values 
applicable  to  Kolmogorov's  1941  theory.  Although  the  binary  and 
tertiary  p-models,  respectively,  are  generated  to  possess  the  first  three 
and  four  moments  correctly,  it  is  remarkable  that  the  binary  and 
tertiary  p-models  and  the  measured  distributions  have  approximately 
the  same  high  order  moments  up  to,  say,  about  7  (see  figure  3). 

Other  models  have  also  been  proposed.  For  example,  Novikov  [12] 
proposed  a  uniform  distribution  for  P(M).  At  that  time,  however,  the 
multiplier  distribution  had  not  been  obtained  experimentally.  It  is  now 
clear,  however,  that  a  uniform  distribution  is  not  a  good  model  for  any 
of  the  curves  in  figure  2.  For  example,  a  good  approximation  to  the 
binary  cascade  is  the  triangular  distribution  shown  by  a  solid  line  in 
figure  2. 


4.  CONCLUSIONS 

The  multiplier  distributions  are  a  basic  tool  for  understanding  many  of 
the  scale-similar  properties  of  energy  dissipation  in  turbulence.  In  the 
absence  of  an  ab  initio  theory  that  yields  these  distributions  in  a 
deductive  way,  analytical  progress  can  be  made  only  by  modeling  them 
with  reasonable  schemes.  Here,  we  have  summarized  the  attempts 


made  in  the  last  few  years.  Below,  we  mention  some  applications  of  this 
work. 

The  first  application  is  in  the  calculation  of  Er,  the  energy  dissipation 
over  an  interval  of  size  r;  Er  =  rer.  According  to  the  definition  of  the 
multipliers, 

Er  =  EL  ft  mi,  (6) 

i=l 

where  n(r)=loga(r/L),  and  L  is  the  large-eddy  (integral)  scale.  Within 

the  p-model  scheme  for  a  binary  cascade,  the  probability  density 
function  (PDF)  for  the  ratio  x  =  Er/EL  is 


p(x)= 


A  T_ _ r(log2(r/L)+l ) _ 

x  L  T(k(x)+1)  r(log2(r/L)  -  k(x)+l) 


(7) 


where  A  is  a  normalization  constant,  T  is  the  gamma  function  and 

W¥x  _  lQg(x  (r/L)^og2(i:P)) 

log(p/(l-p)) 


(8) 


A  second  application  is  in  computing  the  exponents  £q,  defined  as 
<Erq>  ~  (r/L)£q  (9) 

The  result  for  the  triangular  distribution  is 

=  ~1og2{4  (1  -~[)  (q+1)(q+2)  )•  (10) 

Corresponding  results  for  the  p-model  are  given  in  [2].  The  PDFs  of  the 
velocity  increments  can  also  be  computed.  On  using  the  second  refined 
similarity  hypothesis  [13],  the  velocity  increments  can  be  written  as 


Au(r)  =  V  Eri/3. 


(11) 


In  Ref.  [14],  we  obtained  the  probability  density  of  V.  The  PDF  of  Au  can 
be  computed  if  some  model  is  assumed  for  Er.  In  Ref.  [8],  those  PDFs 

were  computed  using  the  binary  p-model. 
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The  tertiary  p-model  has  been  used  to  generate  a  signal  that  shares 
many  features  of  a  real  turbulent  velocity  trace.  This  issue  will  be 
addressed  elsewhere. 
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Abstract 

Fluid  flow  in  the  larger  arteries  is  essentially  unsteady;  under  normal  physiological 
circumstances  it  may  be  regarded  as  laminar.  To  obtain  more  insight  in  the  complex  blood  flow 
patterns  in  the  carotid  artery  bifurcation  and  around  the  aortic  valve,  finite  element  calculations 
have  been  carried  out,  validated  by  flow  visualization  studies  and  laser-Doppler  velocity 
measurements.  In  the  numerical  models,  the  arterial  wall  motion  and  leaflet-fluid  interaction  are 
taken  into  account,  using  a  quasi  1-D  linear  wave  propagation  model  and  a  fully  coupled 
iterative  method,  respectively. 


Introduction 

The  analysis  of  blood  flow  around  heart  valves  and  in  bifurcations  is  a  major  research  topic  in 
biomechanics.  Due  to  the  highly  complicated  geometry  of  the  arterial  system,  the  unsteadiness 
of  the  flow,  the  large  displacements  of  the  valve  leaflets,  the  deformability  of  the  vessel  wall 
and  the  non-Newtonian  behaviour  of  blood,  the  subject  is  a  challenge  for  the  engineering 
disciplines.  The  recent  progress  in  this  field  [1]  is  mainly  due  to  the  fact  that  the  experimental 
tools  and  computational  methods  are  so  advanced  now  that  unsteady  three-dimensional  flow 
analyses  in  complex  geometries  have  become  common  practice.  Besides,  the  software  and 
hardware  in  detection  apparatus  (like  ultrasound  and  MRI)  are  so  improved  that  non-invasive 
real-time  blood  flow  analyses  can  be  performed  clinically  in  the  near  future  and,  hence,  detailed 
fluid  mechanical  information  is  needed  to  interpret  the  clinical  data  [2].  The  present  state  of 
research  [1]  is  that  in  various  bifurcations  a  similar  steady  flow  behaviour  is  found,  comprising 
flow  separation,  secondary  flow  and  skewing  of  axial  velocity  profiles.  Differences  observed 
are  mainly  due  to  different  geometries  and  Reynolds  numbers.  However,  the  influences  of  flow 
unsteadiness,  wall  distensibility,  non-Newtonian  effects  and  stenoses  are  not  completely  clear 
yet. 


As  examples  of  the  interesting  flow  phenomena  in  the  cardiovascular  system  and  the  tools 
available,  we  will  focus  in  this  paper  on  the  physical-mathematical  methods  used  to  analyse  the 
entrance  flow  in  a  rigid-walled  curved  tube,  the  fluid  flow  in  a  distensible  model  of  the  carotid 
artery  bifurcation  and  the  interaction  between  fluid  flow  and  leaflet  motion  during  the  opening 
and  closing  of  the  aortic  valve.  The  geometry  of  the  carotid  artery  bifurcation  is  shown  in  figure 
1.  It  consists  of  a  main  branch, the  common  carotid  artery, which  asymmetrically  divides  into 
two  branches, the  internal  and  external  arteries.  A  characteristic  feature  of  the  carotid  artery 
bifurcation  is  the  widening  in  the  proximal  part  of  the  internal  carotid  artery,  the  carotid  sinus. 
As  a  second  example  the  aortic  valve  is  shown  in  the  same  figure.  It  is  one  of  the  four  valves 
controling  the  blood  flow  through  the  heart  and  is  situated  between  the  aorta  and  the  left 
ventricle.  It  consists  of  three  flexible  leaflets  and  behind  each  leaflet  a  cavity  in  the  aorta  wall  is 
present;  the  so-called  sinus  of  Valsalva. 


Figure  1.  Sketches  of  the  carotid  artery  bifurcation  and  the  aortic  valve. 

Under  normal  conditions  the  human  heart  contracts  and  relaxes  about  once  a  second.  The 
arterial  volume  flow  curve  consists  of  a  period  of  relaxation  (diastole)  followed  by  a  period  of 
contraction  (systole),  see  figure  6.  Due  to  the  contraction  of  the  heart,  blood  flow  is  essentially 
unsteady.  The  Strouhal  and  Reynolds  numbers  are  defined  as: 

Sr  =  Es&  ;  Re  =  5^  eq.l 

u  [L 

Here  u  is  the  mean  velocity  at  the  inlet  of  the  artery,  D  is  the  diameter  of  the  artery  and  co  is 
the  angular  frequency  of  flow  variation.  For  the  density  p  and  viscosity  p  of  blood  the  constant 
values  of  1050  kg/m5  and  3-4  mNs/m2  are  commonly  used,  although  in  regions  with  flow 
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separation  non-Newtonian  effects  may  play  a  role  [3],  In  the  present  study  the  blood  flow  is 
assumed  to  be  laminar,  although  in  some  arteries  the  peak  Reynolds  number  is  quite  high. 
However,  the  unsteadiness  of  the  flow  strongly  affects  the  critical  Reynolds  number  [4]. 
Finally,  the  arterial  walls  are  distensible.  This  results  in  the  occurrence  of  wave  phenomena; 
i.e.  pressure  and  flow  disturbances  travel  with  finite  velocity,  experience  damping,  and  will 
reflect  at  discontinuities  [5].  The  physiological  values  of  the  various  parameters  for  the  aorta 
and  the  carotid  artery  are  summarized  in  Table  1,  adapted  after  Caro  ct  a!.  [6j. 


Ascending 

aorta 

Carotid 

artery 

Diameter  D  [mm] 

15 

5.0 

Peak  blood  velocity  Umax  [m/s] 

1.2 

0.5 

Mean  blood  velocity  u  [m/s] 

0.2 

0.2 

Measured  wave  speed  c  [m/s] 

5.0 

8.0 

Reynolds  number 

750 

275 

Reynolds  number  (peak) 

4500 

700 

Strouhal  number 

0.95 

0.28 

Speed-ratio  Umax  /c 

0.24 

0.06 

Table  1.  Normal  values  for  canine  cardiovascular  parameters  at  a  heart  rate  of  2  Hz  [6]. 


Many  researchers  have  contributed  to  a  better  insight  into  the  flow  phenomena  around  the 
aortic  valve  and  in  the  carotid  artery  bifurcation.  Significant  contributions  to  the  understanding 
of  the  fluid  mechanical  aspects  in  aortic  valve  functioning  were  delivered  by  Henderson  and 
Johnson  [7],  Bellhouse  and  Talbot  [8],  Peskin  [9]  and  Wipperman  [10].  Extensive  calculations 
on  blood  flow  in  straight  arteries  were  performed  by  Womersley  [11].  Olson  [12]  concluded 
from  a  steady  flow  study  in  a  symmetrical  three-dimensional  bifurcation  that  the  flow 
phenomena  occurring  in  the  daughter  branches  are  highly  determined  by  curvature  effects.  Yao 
and  Berger  [13]  further  investigated  the  steady  entry  flow  in  a  curved  pipe.  Detailed 
experimental  and  numerical  information  about  the  flow  field  in  the  carotid  bifurcation  has  been 
obtained  by  Bahradvaj  et  al.  [14]  and  for  the  unsteady  case  by  Ku  et  al.  [15]  and  Perktold  [16]. 
In  our  group,  model  studies  of  the  flow  phenomena  around  the  aortic  valve  were  performed 
[17,18]  and  measurement  techniques  were  developed  for  oscillatory  flow  [19].  To  gain  insight 
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into  secondary  flow  patterns,  entrance  flow  in  a  90-degree  bend  was  studied  [20-22].  Next, 
steady  and  unsteady  flow  in  models  of  the  carotid  artery  bifurcation  were  analysed  [23-26]. 
Here,  we  will  briefly  describe  the  methods  used  and  present  some  characteristic  results. 
Experimental  validation  of  the  methods  used  is  reported  elsewhere  [27-30].  Finally,  some 
future  research  directions  will  be  pointed  out. 

Entrance  flow  In  a  90-degree  curved  tube 

The  flow  of  an  incompressible  isothermal  Newtonian  fluid  is  described  by  the  Navier-Stokes 
and  continuity  equations.  To  solve  these  equations  they  are  discretized  using  a  standard 
Galerkin  finite  element  method  which  is  implemented  in  the  finite  element  package  SEPRAN 
[31].  The  spatial  discretization  results  in  the  following  set  of  non-linear  first-order  differential 
equations  [32]: 


Mii + [S  +  N  (u)3  a  +  LT  p  =  f + b 

eq.2a 

icT 

n 

o 

eq.2b 

where  Mii  represents  the  local  acceleration  term,  Su  the  viscous  term,  N(y)u  the  convective 
acceleration  term,  LTp  the  pressure  gradient  term  and  Lu  the  velocity  divergence  term.  The 
body  and  boundary  forces  are  represented  by  f  and  fc  respectively,  while  a  contains  the  velocity 
and  p  the  pressure  unknowns  in  the  nodal  points.  The  local  time  derivative  in  eq.2a  is 
approximated  by  : 


jin+e  =  Mn>.L-  Un  eq.3 

At 

in  which  un  is  an  abbreviation  for  u(nAt)  with  At  the  time  step.  The  non-linear  convective  term 
is  linearized  using  one  step  of  a  Newton-Raphson  iteration  scheme.  In  order  to  reduce  the 
number  of  unknowns,  a  penalty  function  method  is  applied  [33].  For  the  solution  of  the  matrix 
equation  a  direct  technique  was  used . 

The  elements  used  belong  to  the  group  of  so-called  modified  Crouzeix-Raviart  elements.  For 
2D  situations,  like  the  aortic  valve  model,  a  7-noded  element  with  3  pressure  and  14  velocity 
unknowns  is  used.  3D  situations,  like  the  curved  tube  and  the  carotid  artery  bifurcation,  require 
the  application  of  a  27-noded  element  with  4  pressure  and  81  velocity  unknowns.  (See  figure 
2).  The  accuracy  is  0(Ax3)  for  the  velocity  and  0(Ax2)  for  the  pressure  with  Ax  being  a 
characteristic  dimension  of  the  spatial  discretization.  The  resolution  in  time,  using  0=1/2,  is 
shown  to  be  0(At2). 
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Figure  2.  The  7-noded  and  27-noded  elements  used  in  solving  the  2D  and  3D  fluid  motion 
equations,  respectively. 

The  flow  at  the  inlet  is  supposed  to  be  fully  developed.  The  velocities  at  the  wall  are 
presumed  to  be  zero.  At  the  outlet  the  normal  and  both  tangential  stresses  are  set  to  zero,  while 
in  the  plane  of  symmetry  both  tangential  stresses  and  the  normal  velocity  component  are  put  to 
zero. 

As  an  example  some  results  of  unsteady  entrance  flow  in  a  90-degree  curved  tube  will  be 
shown.  The  flow  rate  consisted  of  a  sinusoidally  varying  unsteady  flow  component 
superimposed  on  a  steady  flow  component.  With  a  curvature  ratio  of  8  =  1/6,  the  Dean  number 

(=  fs  Re)  varied  between  82  and  327  The  Strouhal  number  was  0.46.  In  figure  3  the  results  of 
axial  and  secondary  flow  at  8  =  22.5°  from  the  straight  inlet  tube  are  shown.  Axial  flow  is 
presented  by  axial  isovelocity  lines  and  secondary  flow  is  visualized  by  means  of  velocity 
vectors.  Contour  level  0  corresponds  to  zero  axial  velocity.  As  a  consequence  of  centrifugal 
forces,  the  secondary  velocities  near  the  plane  of  symmetry  are  directed  towards  the  outer  bend 
(O),  whereas  near  the  side  wall  of  the  curved  tube  these  secondary  velocities  are  directed 
towards  the  inner  bend  (I),  resulting  in  a  Dean-type  secondary  flow  field.  For  the  total  period 
of  time  the  center  of  this  secondary  vortex  is  situated  near  the  center  line  of  the  cross-sectional 
plane.  The  secondary  velocities  at  t=l/4T  and  1/2T  are  about  equal,  as  well  as  the  secondary 
velocities  at  t=3/4T  and  t=0.  At  minimal  flow  rate  a  region  with  low  secondary  velocities  is 
found  near  the  inner  bend.  As  a  consequence  of  secondary  flow  a  shift  of  the  maximum  of  axial 
velocity  towards  the  outer  bend  is  observed  for  the  whole  period  of  time.  This  shift  is  maximal 
at  minimal  flow  rate  (t=3/4T).  At  this  time  interval  a  region  with  negative  axial  velocities  is 
observed  at  the  inner  bend.  Similar  results  were  obtained  at  other  axial  positions  [22].  There  it 
also  appeared  that  axial  flow  is  highly  determined  by  secondary  flow.  The  influence  of  the 
frequency  parameter  and  the  flow  wave  form  appeared  to  be  of  minor  importance. 
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Fluid  flow  in  a  distensible  model  of  the  carotid  artery  bifurcation 

In  the  numerical  analysis  of  the  flow  phenomena  in  distensible  geometries  not  only  the  fluid 
motion  equations  but  also  the  equations  governing  wall  motion  should  be  taken  into  account. 
Motivated  by  the  small  ratio  of  fluid  velocity  over  wave  velocity  0.06  for  the  carotid  artery) 
an  uncoupled  approach  is  followed.  First,  the  time  dependent  pressure  distribution  due  to  wave 
propagation  is  determined.  Next,  the  wall  motion  due  to  this  time  dependent  pressure 
distribution  is  calculated.  Finally,  the  wall  motion  is  prescribed  as  a  boundary  condition  for  the 
solution  of  the  fluid  motion.  We  will  briefly  describe  the  three  steps  in  this  procedure  and 
finally  an  illustrative  result  will  be  given. 

The  time  dependent  pressure  distribution  is  calculated  using  a  quasi- ID  linear  model 
describing  wave  propagation.  The  linearized  one-dimensional  mass  and  momentum  equations 
for  unsteady  viscous  flow  through  a  visco-elastic  tube  are  used,  which  read  [34]: 


8A  3u 

—  +  Ao  —  =  0 
8t  dx 


P 


3u  3p 
—  +  —  =  f 

St  dx 


eq.4a,b 


In  these  equations  A  =  jtR2  is  the  cross-sectional  area  of  the  tube,  Aq  is  the  cross-sectional  area 
in  the  reference  state,  u(x,t)  and  p(x,t)  are  the  axial  velocity  and  transmural  pressure,  both 
averaged  over  the  cross-sectional  area,  and  f  is  the  so-called  friction  function.  The  assumption 
of  one-dimensionality  holds  if  the  wave  length  is  large  compared  to  the  diameter  of  the  tube; 
upto  the  fifteenth  harmonic  in  the  carotid  artery  tnis  ratio  is  larger  than  40.  Due  to  the  assumed 
linearity  the  equations  can  easily  be  solved  using  harmonic  functions.  To  that  end,  two 
additional  assumptions  have  to  be  made;  one  for  the  friction  function  and  the  other  for  the 
constitutive  relationship  between  A  and  p.  For  the  friction  term  a  frequency  dependent 
expression  is  used,  derived  from  the  two-dimensional  theory  of  unsteady  fully-developed  flow 
in  uniform  visco-elastic  tubes  [1 1].  Visco-elastic  wall  behaviour  is  taken  into  account  by  an 
experimentally  determined  constitutive  relationship  between  the  complex  amplitudes  of  pressure 
and  cross-sectional  area  variations.  The  influence  of  wave  reflections  is  incorporated  in  the 
model  by  treating  the  carotid  artery  bifurcation  as  a  bifurcation  of  uniform  tubes.  Due  to  the 
very  large  wave  length/diameter-ratio  the  contribution  of  local  inhomogeneities,  like  the  carotid 
sinus,  to  the  reflection  and  transmission  coefficients  appeared  to  be  negligible.  Reflections  from 
the  outflow  ends  of  the  internal  and  external  arteries  were  incorporated  in  a  standard  manner 
[29]. 

Next,  the  wall  motion  of  the  carotid  bifurcation  due  to  the  propagating  pressure  pulse  is 
analysed.  Because  of  the  magnitude  of  the  deformation  (relative  diameter  changes  up  to  10  % 
are  found)  the  system  to  be  analysed  is  geometrically  non-linear.  For  practical  reasons  the 
mechanical  behaviour  of  the  vessel  wall  is  described  with  a  Hookean  constitutive  relation.  The 
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matenai  parameters  are  selected  in  such  a  way  that  diameter  changes  similar  to  the  ones 
observed  in  vivo  are  obtained.  Visco-elastic  and  inertia  effects  are  neglected,  reducing  the 
problem  to  a  quasi-static  one.  Although  the  wall  of  the  artery  has  a  thickness  which  is  about  20 
times  smaller  than  its  diameter,  experiments  show  that  its  bending  stiffness  is  considerable 
[35],  These  considerations  lead  to  the  use  of  shell  elements;  compared  with  the  application  of 
three-dimensional  brick  elements  this  results  in  a  considerably  smaller  system  of  equations.  The 
finite  element  package  MARC  is  used  with  an  8-noded  thick  shell  element  with  global 
displacements  and  rotations  as  degrees  of  freedom  and  four  integration  points  [36].  Boundary 
conditions  for  the  wall  motion  are  applied  in  a  rather  pragmatical  way  [29]. 

The  final  step  in  the  numerical  approach  is  the  solution  of  the  fluid  motion  equations  using  the 
finite  element  method  described  before.  However,  in  this  case  for  each  time  step  the  mesh  is 
adapted  to  account  for  the  wall  motion.  As  inflow  condition  the  fully  developed  axial  and  radial 
velocity  profiles  as  given  by  Womersley  [11]  were  prescribed.  The  boundary  conditions  at  the 
wall  were  obtained  by  taking  the  time  derivative  of  the  wall  displacement.  At  both  outflow  ends 
boundary  conditions  of  the  stress-type  cn  =  -p  and  at  =  0  were  prescribed.  The  pressures 
needed  for  this  purpose  were  calculated  using  the  quasi-ID  model  for  wave  propagation.  To 
investigate  the  effect  of  the  wall  motion,  the  flow'  field  calculated  for  the  distensible  carotid 
artery  bifurcation  model  was  compared  with  the  flow  field  calculated  for  a  corresponding  rigid 
model.  The  geometry  of  the  rigid  model  was  chosen  to  be  equal  to  the  end-diastolic  geometry 
of  the  distensible  model.  In  both  cases  an  identical  flow  rate  at  the  entrance  was  prescribed. 

In  figure  4  the  flow  rates  at  the  outflow  of  the  rigid  model  are  compared  with  those  in  the 
distensible  model.  Due  to  the  increase  in  its  volume,  part  of  the  volume  flow  entering  the 
distensible  model  is  accumulated  during  acceleration.  During  deceleration  this  part  is  released. 
This  effect  accounts  for  the  difference  between  the  outflow  rates  in  the  rigid  and  the  distensible 
model.  As  an  illustration  of  the  results,  the  axial  flow  field  in  the  internal  carotid  artery  at 
minimal  flow  rate  (end-systole)  is  shown  in  the  same  figure.  Here,  contour  level  1  corresponds 
to  zero  axial  velocity;  a  reversed  flow  area  is  observed  at  the  non-divider  wall  (N).  At  the 
entrance  of  the  internal  carotid  artery  (10)  and  halfway  the  sinus  (II)  the  reversed  flow  area 
occupies  about  40  %  of  the  local  cross-sectional  area.  For  the  distensible  case  both  the  size  of 
the  reversed  flow  area  and  the  magnitude  of  the  negative  velocities  are  found  to  be  smaller.  In 
both  cases  Dean-type  secondary  vortices  are  observed  resulting  in  C-shaped  curvatures  of  the 
corresponding  axial  isovelocity  profiles.  In  the  rigid  model  the  curvature  is  more  pronounced. 
Most  likely,  the  release  of  "flow  accumulated  during  acceleration"  in  the  distensible  model  is 
the  dominant  factor  causing  the  differences  with  the  flow  field  observed  in  the  rigid  model  [29]. 

The  above  described  flow  field  in  the  carotid  artery  changes  cc  lerably  as  a  function  of 
time.  The  recirculation  area  is  found  to  develop  during  deceleration  and  to  disappear  during 
acceleration.  Also  the  secondary  flow  field  and  the  curvature  of  the  axial  isovelocity  lines  are 


distensible 


f  f  ■  f 


distensible 


Figure  4.  Comparison  between  axial  velocity  profiles  in  the  symmetry  plane  and  at  three- 
cross-sections  in  the  internal  carotid  artery  at  end-systole  (ES),  calculated  for  the 
distensible  and  rigid  models  [29]. 
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strongly  time-dependent.  In  general,  the  distensibility  of  the  wall  is  found  to  reduce  some  of 
these  effects,  although  the  global  structure  of  the  flow  field  remains  unchanged  [29],  However, 
this  conclusion  should  be  handled  with  care  as  no  comparison  has  been  made  between  the  3D 
pressure  field  with  the  pressure  distribution  calculated  using  the  quasi- ID  model  for  wave 
propagation.  Since  the  latter  was  applied  to  calculate  the  wall  motion,  it  should  not  deviate  too 
much  from  the  3D  pressure  field  if  a  consistent  solution  of  the  fluid  motion  is  to  be  obtained. 
This  will  be  a  future  research  topic. 

Fluid-leaflet  interaction  in  the  aortic  valve 

Due  to  the  strong  interaction  between  structure  motion  and  fluid  flow,  in  the  numerical  model 
for  the  dynamic  behaviour  of  the  aortic  valve  fluid  and  structure  are  iteratively  coupled.  The 
equations  of  motion  of  the  subsystems  are  evaluated  separately  and  an  iteration  is  performed 
until  convergence  is  achieved.  Because  of  computational  reasons,  the  fluid  flow  and  the 
structure  are  restricted  to  be  two-dimensional.  For  the  fluid  the  full  unsteady  two-dimensional 
incompressible  Navier-Stokes  equations  are  solved  by  means  of  the  standard  Galerkin  finite 
element  method  described  before.  For  stability  reasons,  the  time  integration  is  performed  with  a 
complete  Euler  implicit  scheme  (0=1  in  eq.3).  As  initial  condition,  the  steady  state  solution  for 
a  fully  opened  valve  is  taken.  As  a  boundary  condition  at  the  entrance,  a  time-varying  parabolic 
axial  velocity  profile  is  prescribed  and  the  radial  i  low  is  set  to  zero.  The  entrance  channel  is 
chosen  to  be  long  enough  to  guarantee  a  full  development  of  the  unsteady  velocity  profile.  Due 
to  the  lack  of  experimental  data  this  fully  developed  entrance  condition  is  used,  although  it 
certainly  does  not  correspond  to  the  physiological  one.  As  contact  condition  on  the  valve,  the 
fluid  velocity  is  set  equal  to  the  local  valve  velocity,  determined  from  the  actual  valve  position 
and  the  valve  position  at  the  previous  time  level.  At  the  outlet  a  stress-free  flow  condition  is 
prescribed.  At  the  channel  walls  a  no-slip  condition  is  used. 

To  incorporate  the  moving  structure  a  segmented  valve  is  considered,  consisting  of  several 
rigid  segments  connected  to  each  other.  The  physiological  relevance  of  this  model  valve  is  that 
with  an  increasing  number  (n)  of  links,  the  segmented  valve  will  behave  more  and  more  like  a 
fully  flexible  valve.  In  the  special  case  when  n=l,  the  valve  reduces  to  the  so-called  rigid  valve 
[37],  In  figure  5  some  segment  quantities  are  defined.  Each  segment  has  one  degree  of 
freedom:  its  angle  of  rotation  (f>j.  The  inertia  of  the  valve  is  neglected.  At  each  point  of  time  a 
segment  must  satisfy  the  equilibrium  conditions: 

4x  I  +  4x2  +  ffx  =  0  ;  4yl  +  4y2  +  4y  =  0 
4x2  *i  +  4y2  h  COS(Pi  +  mi  =  0 


eq.5c 
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Figure  5.  Definition  of  the  geometry  and  the  forces,  acting  on  a  segmented  valve. 


with  ff  the  fluid  force  and  fh  the  hinge  force  respectively.  I  is  the  length  of  the  segment.  Index  i 
refers  to  segment  i,  subscripts  x  and  y  refer  to  the  horizontal  and  vertical  directions,  whereas  1 
and  2  refer  to  the  left  and  right  edge,  respectively  m‘f  denotes  the  moment  due  to  the  fluid 
flow  about  the  center  of  the  hinge  with  segment  .-1.  The  fluid  forces  and  moments  must  be 
calculated  by  the  finite  element  method.  On  the  segment  edges  the  force  continuity  conditions: 

4x2  =  "4ixi  1  <hy2  *  '*hyi  eq.6a,b 


are  imposed.  The  boundary  conditions  for  segment  n: 

fhx2  =  0  ;  *hy2  =  0  i  m2 “  0 


eq.7a,b,c 


complete  the  set  of  equations.  For  an  arbitrary  set  (fr  (i=l,n)  the  system  is  not  in  equilibrium. 
For  those  states  a  residual  moment  m'res  per  segment  is  defined  as: 


m*reS  =  4ix2  sin(Pi  +  <hy2  li  c°S<Pi  + 


eq.8 


The  unknown  hinge  forces  can  be  calculated  in  a  recursive  way: 


fhxl  -  ■  4c  ;  4yl  -  4yl  *  4y 


eq.9a,b 


Starting  from  the  valve  tip,  where  boundary  conditions  (eq.7)  can  be  applied,  the  hinge  forces 
fhx  and  Fhy  can  be  calculated  for  every  segment  with  eq.9,  which  in  turn  can  be  substituted  in 
eq.8.  This  leads,  together  with  eqs.  5  and  6  and  expressions  for  the  fluid  moment  and  forces. 
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to  n  equations  with  unknowns  <pi  (i=i,n).  This  set  of  equations  is  solved  by  reformulating  the 
problem  as  a  non-linear  least  squares  problem.  To  this  end  a  residual  moment  is  defined: 


where  r  is  the  total  residual  moment  to  be  minimized  and  mi  the  moment  acting  on  segment  i, 
which  is  a  function  of  a  position  set  <p.  Once  a  minimum  of  r  is  found  which  equals  zero,  <p 
contains  the  equilibrium  position  of  the  valve  segments  and  therefore  of  the  valve  itself.  The 

segment  moments  mj  depend  on  all  <p0 . <pn-  In  general  no  explicit  relation  for  this 

dependence  can  be  given,  especially  not  when  the  finite  element  fluid  model  is  used.  A 
consequence  is  that  derivatives  cannot  be  calculated.  Therefore,  a  method  must  be  used  which 
does  not  require  the  evaluation  of  derivatives.  A  very  suitable  non-linear  least  squares  method 
for  this  study  is  Powell's  hybrid  method  [38].  It  combines  a  robust  convergence  far  from  the 
equilibrium  position,  with  a  fast,  superlinear  convergence  close  to  it.  Notwithstanding  the 
robustness  of  Powell's  method,  it  occasionally  fails  to  find  the  equilibrium  position.  In  these 
cases,  a  new  initial  estimate  is  generated  and  the  iteration  process  is  restarted.  Using  this 
method,  both  the  numbers  of  segments  (n=l  and  n=4)  and  the  physical  parameters,  like  the 
Strouhal  and  Reynolds  numbers,  are  varied. 

In  the  numerical  calculations  a  pulsatile  flow  rate  is  used.  Since  the  flow  rate  decleration  time 
is  an  important  parameter  in  the  valve  closure  mechanism  [17]  ,this  time  is  chosen  as  the 
characteristic  time  T.  Figure  6  shows  the  velocity  field  for  n=l,  Re=313  and  Sr=3.60.  At  t=0.0 
the  velocities  are  small  and  a  vortex  is  present  in  the  sinus.  At  t=1.0  the  fluid  is  accelerated  and 
the  valve  starts  to  open.  At  t=1.5  the  valve  has  reached  its  maximum  velocity  and  pushes  out 
the  contents  of  the  sinus.  On  t=2.0  the  flow  rate  is  maximal,  the  valve  is  slowing  down  but  still 
moving  towards  the  fully  opened  position.  Just  at  t=2.5,  when  the  flow  has  its  maximum 
deceleration,  the  valve  reaches  its  maximum  position  and  its  velocity  becomes  zero.  At  t=2.75 
the  valve  is  moving  towards  the  closed  position.  A  vortex  in  the  sinus  is  being  formed.  At 
t=3.0,  when  the  flow  rate  is  nearly  zero,  the  valve  has  its  maximal  closing  velocity.  At  t=4.0 
the  valve  has  nearly  reached  its  pseudo-equilibrium  position  and  its  velocity  becomes  zero.  It  is 
interesting  to  note  that  the  valve  moves  already  towards  its  closed  position  during  flow 
deceleration,  just  like  the  natural  aortic  valve  does.  This  observation  is  consistent  with  quasi- ID 
model  predictions  [17]. 

For  lower  Strouhal  numbers  the  motion  of  the  valve  is  found  to  be  nearly  in  phase  with  the 
flow  rate  and  its  amplitude  is  larger  [30].  The  influence  of  a  variation  of  the  Reynolds  number 
between  300  and  1 100  appeared  to  be  only  marginal.  Higher  Reynolds  number  values  could 
not  be  reached  as  the  fluid  solver  does  not  allow  computations  for  Re  >1 100.  Also,  similar 
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results  were  obtained  for  a  segmented  valve  with  n=4;  the  curvature  changes  of  the  leaflet  are 
only  modest.  Only  at  a  much  higher  Strouhal  number  or  when  a  harmonically  varying  flow  rate 
was  used,  large  deformations  of  die  valve  leaflet  were  observed.  Finally,  it  need  to  be 
mentioned  that  if  the  simulation  of  a  full  physiological  flow  cycle  is  required,  including  back 
flow  and  full  valve  closure,  the  model  must  be  extended  with  a  closure  algorithm,  which  is  not 
included  in  the  present  model.  This  and  the  extension  to  the  three-dimensional  case  will  be  a 
future  research  topic. 

Concluding  discussion 

From  the  present  study  it  is  concluded  that  the  finite  element  method  can  be  used  for  detailed 
analyses  of  unsteady  blood  flow  in  complex  three-dimensional  geometries.  The  local  geometry 
of  the  bifurcation  highly  affects  the  axial  and  secondary  flow  fields,  but  the  wall  distensibility 
has  only  a  marginal  influence.  During  flow  acceleration  and  deceleration  the  valve  leaflet  shows 
large  displacements,  which  induce  vortex  like  velocity  patterns  in  the  cavity  behind  the  leaflet 
As  presented  elsewhere,  a  good  agreement  is  found  between  the  numerical  and  experimental 
results  [27-jO]. 

Some  future  research  topics  in  the  analysis  of  fluid  flow  in  large  blood  vessels  are  [1 J: 

—  Improvements  are  needed  of  the  models  for  fluid-structure  interaction.  In  case  of  the 
analysis  of  the  influence  of  wall  distensibility  on  fluid  flow,  the  improvements  need  to 
include  a  proper  account  for  the  effects  of  reflections  originating  from  the  peripheral 
vascular  bed,  the  application  of  physiologically  relevant  boundary  conditions  in  the 
calculation  of  wall  motion,  the  use  of  a  model  describing  the  complex  mechanical 
behaviour  of  the  wall  and  an  experimental  validation  of  the  uncoupled  approach  to 
calculate  the  flow  field  in  such  a  distensible  geometry.  For  a  complete  analysis  of  the 
dynamic  heart  valve  behaviour,  a  three-dimensional  fluid  model  must  be  applied  and  in 
some  cases  valve  inertia  or  visco-elasticity  of  the  leaflet  material  has  to  be  incorporated. 
Furthermore,  attention  should  be  given  to  incorporate  the  possibility  of  full  valve  closure. 
Then  the  models  can  be  used  to  improve  the  design  of  existing  or  newly  developed  one- 
or  bi-leaflet  disc  type  or  flexible  leaflet  type  heart  valve  prostheses. 

—  The  influences  of  non-Newtonian  effects  and  minor  stenoses  on  the  local  flow  field  are 
not  completely  clear  yet  Especially,  the  visco-  elastic  properties  of  the  blood  may  require 
a  considerable  effort.  For  the  analysis  of  the  influence  of  minor  stenoses  on  the  flow  field 
spectral  element  methods  and  PIV-methods  may  be  favourable,  as  detailed  analyses  have 
to  be  performed  of  eddy  generation.  Besides,  the  influence  of  biological  variabilities  (like 
bifurcation  angle  and  divider  geometry  in  case  of  a  bifurcation)  need  research  attention. 

—  Atherosclerotic  plaque  formation  is  found  at  positions  where  low  wall  shear  rates  occur 
[39].  Here  focal  regions  of  elevated  permeability  to  macromolecules,  temporary  gaps  and 
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a  thin  endothelial  glycocalyx  are  found.  Modelling  of  the  arterial  wall  is  worthwhile, 
focussing  on  maximal  stress  concentration  in  the  intima  and  the  mass  transport  of 
atherogenic  proteins  between  plasma  and  vessel  wall  [40].  Then  probably  an 
improvement  can  be  reached  with  regard  to  the  understanding  of  the  fluid  mechanical 
contribution  to  the  process  of  atherogenesis. 
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Abstract 

Transonic  flows  are  now  as  before  fascinating  and  very  important  in  respect  to 
theory,  experiment  and  applications.  In  the  early  days  analytical  methods  prevailed  that 
led  to  fundamental  results.  Now  a  days  due  to  increasing  power  of  computers  numerical 
methods  dominate.  Validation  problems  of  Euler-  and  Navier-Stokes  Codes  are  of 
primary  interest.  This  led  to  simple  but  fundamental  experiments  (DFVLR-F5  wing, 
vortex  flow,  shock  boundary  layer  interaction,  condensation  phenomena, ... )  and  the 
comparison  with  theory.  The  status  quo  of  validation  is  today  not  sufficient  in  all  respects. 
There  are  complex  dependences  on  the  block  structures,  the  meshes,  the  laminar- 
turbulent  transition  and  the  turbulence  models.  Some  future  trends:  Search  for  reliable 
codes  for  3D-boundary  value  problems  to  develop  ai  rfoi  Is  of  low  drag  -  especially  laminar 
transonicairfoils;  study  of  diabatic  flows  of  phase  changing  media  with  several  applications 
(lift  and  drag  of  airfoils,  steam  turbines,  cryogenic  tunnel,  Organic  Rankine  Cycle, ... ); 
transonic  flows  in  retrograde  media  (e.g.  BZT  fluids). 


1 .  INTRODUCTION:  WHAT  IS  A  TRANSONIC  FLOW? 

In  transonic  flows  the  magnitude  of  the  fluid  velocity  is  comparable  with  the  sonic 
speed  (=  signal  velocity).  The  body  moves  nearly  with  the  same  velocity  with  which 
disturbances  propagate.  The  theoretical  and  experimental  investigation  of  these  flows  is 
now  and  as  has  been  before,  of  great  interest  and  has  become  of  considerable  technical 
significance.  Nowadays  there  are  many  applications  in  aerodynamics  of  transonic  flight 
for  instance  in  the  field  of  civil  aviation  and  thermal  fluid  machinery  which  underline  even 
the  economical  importance  of  these  investigations.  The  declared  aim  is:  reliability,  high 
capacity,  less  drag,  less  noise,  less  pollution. 

The  typical  qualities  of  these  flow  fields,  especially  the  mathematical  and  physical 
problems  involved,  can  already  be  understood  by  studying  slender  body  flows  (Fig.  1 ). 
In  transonic  slightly  subsonic  flow  Ms<1  a  local  supersonic  region  develops  in  the 
neighbourhood  of  maximum  thickness  which  as  a  rule  is  limited  downstream  by  a  normal 
shock  (Fig.  2).  The  local  Mach  numberdistribution  on  the  airfoil  explains  the  flow.  In  front 
of  the  body  the  flow  decelerates,  behind  the  stagnation  point  acceleration  to  supersonic 


Sonicline 
Machline  ^ 


Shock 


ft  Figure  2.  Schlieren  picture.  Rough  surface. 
Mach  lines  and  shock.  Supersonic  region. 

<=  Figure  1.  Fluid  fields  and  Mach  number 
distributions  on  the  profiles, 
a)  <  1 ,  b)  M_  ->  1 ,  c)  M_  >  1 . 


velocity  exists.  In  the  shock  the  velocity  jumps  to  subsonic  velocity  with  following 
postexpansion  (Figs.  3,4).  After  this  deceleration  to  backward  stagnation  point  with 
following  approach  to  conditions  at  infinity.  In  transonic  slightly  supersonic  flow  >  1 
the  shock-wave  detaches  as  a  rule.  Between  shock  and  body  a  local  subsonic  region 
builds  up.  The  displacement  effect  of  the  body  leads  to  an  acceleration  to  supersonic 


Figure  3.  Black-white  Mach-Zehnder 
interferogram  of  transonic  flow  at  a  curved 
surface.  M  =  local  Mach  number. 


Figure  4.  Shock  boundary  layer 
interaction  at  an  airfoil. 


velocity  till  the  trailing  edge  shock  is  reached.  The  limit  Machline  is  the  last  characteristic 
from  the  airfoil  that  reaches  the  subsonic  region,  while  the  last  characteristic  running 
down  from  the  subsonic  domain  limits  the  range  of  influence  downstream.  M^-*  1  leads 
to  the  limiting  case  of  sonicvelocity  at  infinity.  The  sonic  line  extendstoinfinity.  At  the  body 
the  flow  accelerates  until  the  rear  shock  is  reached.  Comparing  the  Mach  number 
distributions  on  the  airfoil  we  see,  that  only  slight  changements  exist;  the  flow  freezes. 
Asimpieexplanationisasfollows:  if  is  only  slightly  above  onethe  shock  wave  stands 

far  upstream  with  Mach  number  behind  the  shock  M^5 1 .  That  means  the  profile  don't 
realize  whether  is  stands  in  a  transonic  supersonic  or  subsonic  flow.  With  other  words 
the  local  dates  at  the  profile  change  from  M^S  1  to  Mm  >1  only  slightly. 

The  mathematical  description  leads  even  for  slender  bodies  with  small  disturbance 
theory  to  quasilinear  partial  differential  equations.  On  the  sonic  line  (sonic  surface)  the 
type  changes,  across  the  shock  curve  (shock  surface)  the  shock  relations  have  to  be 
fulfilled.  Both  curves  (surfaces)  are  not  known  a  priori  and  have  to  be  determined  with  the 
solution. 


2.  TRENDS  IN  THE  PAST 

The  great  technical  importance  of  transonic  flows  on  the  one  hand  and  the 
interesting  mathematical-physical  problems  on  the  other  hand  had  led  to  an  extensive 
theoretical  and  experimental  treatment  of  relevant  problems  in  the  past  50  years.  The 
Proceedings  of  IUTAM  Symposium  Transsonicum  1 1 962  (Aachen)  [1  ],  I1 1 975  (Gottingen) 
[2],  III  1 988  (Gottingen)  [3]  give  an  interesting,  unique  survey  of  the  worldwide  efforts  until 
today.  Concerning  theory,  1 962  the  analytical  methods  prevailed.  Among  others  we  have 
the  hodograph  method,  the  parabolic  (local  linearization)  method,  the  characteristics 
method  and  the  integral  equation  method.  A  lot  of  general  results  came  out  like  similarity 
laws,  equivalence  rule,  area  rule,  asymptotic  expansions,  ...  Beside  this  some  exact 
solutions  for  special  profiles  had  been  discovered.  The  small  power  of  computers  at  that 
time  didn't  allow  the  numerical  treatment  of  the  direct  problem.  Already  1 975  the  interest 
changed  remarkable  to  Computational  Fluid  Dynamics  (CFD).  The  increasing  power  of 
computers  allowed  for  the  first  time  the  direct  numerical  treatment  of  the  nonlinear 
differential  equations  in  the  fluid  field.  Questions  of  stability,  convergence  and  estimation 
of  errors  were  more  or  less  open  this  time.  This  changes  drastically  in  the  following  years. 


3.  TRENDS  TODAY  ILLUSTRATED  BY  DIFFERENT  CODE  VALIDATIONS 

The  Proceedingsof  the  Symposium  Transsonicum  III  [3]  describe  thisdevelopment. 
The  concluding  remarks  by  Hornung  give  a  (subjective)  pointed  description  worth 
reading  of  the  status  quo  with  the  progress  achieved  but  with  open  questions  too. 
Concerning  the  CFD-methods  the  validation  of  Euler-  and  Navier-Stokes  Codes  are  of 
primary  interest.  This  led  to  some  simple  but  fundamental  experiments,  e.g.  the  DFVLR- 
F5  wing  experiment  (Fig.  5)  and  the  vortex  flow  experiment  (Fig.  6)  and  the  comparison 


primary  vortex 


Figure  5.  Test  airfoil  DFVLR-F5  in 
the  transonic  1  m  x  1  m  test  section 
in  Gottingen. 


Figure  6.  Streamlines  of  the  primary  and 
secondary  vortex  (M_=  0.85,  a=  10°). 


with  numerical  calculations  for  these  cases.  Some  other  experiments  and  validations  will 
be  discussed  later  on. 

In  principle  there  are  two  concepts  for  the  numerical  calculation  of  the  viscous  flow 
around  wings.  By  using  the  zonal  numerical  approach  in  different  domains  of  the  flow 
field  different  differential  equations  according  to  the  physical  effects  will  be  used  (Fig.  7). 
Outside  the  boundary  layer  in  front  of  the  shock  we  use  the  potential  equation,  behind 
the  shock  the  Euler  equations,  near  the  body  the  boundary  layer  equations,  near  the 
trailing  edge  and  in  the  wake  region  the  Navier-Stokes  equations.  The  shock-boundary 
layer  interference  is  given  by  a  separate  local  solution  which  has  to  be  integrated 
iteratively  in  the  global  flow  field.  The  second  method  solves  the  full  Navier-Stokes 
equations  in  the  total  flow  field.  The  main  point  is  to  choose  a  suitable  block-structured 
grid  which  secures  ahigh  resolution  in  the  viscous  interaction  region  and  at  the  shock  and 
the  trailing  edge.  Within  the  inviscid  flow  field  zones  a  reduced  resolution  is  possible.  This 
concept  needs  an  optimization  in  the  choise  of  computational  grids  and  blocks. 


Figure  7.  Phenomena  and  zonal 
model  equations  for  transonic  flow 
at  an  airfoil. 


A  workshop  was  organized  by  Kordulla  and  Sobieczky  [4]  before  the  Symposium 
Transsonicum  III.  A  well-defined  boundary  value  problem,  the  flow  overawing  (DFVLR- 
F5)  in  a  wind  tunnel  was  formulated  (Figs.  8,  9).  Boundary  conditions  were  given  at  the 
walls  and  in  the  entrance  and  exit  area.  All  groups  and  institutions  that  had  developed 
suitable  Navier-Stokes  programs,  were  invited  to  participate  in  solving  this  flow  problem 
numerically.  Active  participants  came  from:  NASA  Ames,  NASA  Langley,  NAL  Japan, 
FFA  Sweden,  MBB  Munich,  DFVLR  Gottingen.  All  theoretical  results  were  collected, 
compared  with  each  other  and  with  the  experiments  performed  in  Gottingen.  This  seems 
to  be  a  milestone  and  should  be  continued  in  future. 


A  -  A 


Figure  8.  3D  -  test  airfoil  [4], 


INLET  PLANE 


Figure  9.  Wing-plate 
configuration  in  1m 
x  1m  transonic  wind 
tunnel  [4], 


Figure  1 0.  Chordwise  c  -  distributions  on  the  wing  surface  in  different  sections  (1 ,4,7,9). 
Experimental  data  given  by  symbols,  solid  lines  theoretical  results(Schwamborn).  Flow 
conditions:  Free-stream  Mach  number  M_=  0.82,  a=  2° ,  Re/m  =  107  [4], 


Figure  11.  Comparison  of  predicted 
skin  friction  line  patterns  for  the 
wind  tunnel  test.  a=  2° .  Left:  lower 
wing  surface.  Right:  upper  wing 
surface.  [4], 
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Figure  12.  Lift  and  drag  coefficients  of  the  DFVLR-F5  Navier-Stokes  solutions  [4], 


The  pressure  distributions  on  the  wing  and  in  the  field  agree  quite  well  among  the 
participants  and  with  the  experiment  (approximately  10%).  The  results  are  not  very 
sensitive  (Fig.  1 0).  The  skin  friction  lines,  especially  if  separation  exists,  are  much  more 
sensitive  (Fig.  1 1 ).  Disappointing  is  the  scatter  of  the  lift  (cL)  and  the  drag  coefficients 
(cD)  (Fig.  1 2).  There  is  a  remarkable  and  complex  dependence  on  the  nature  of  the  used 
meshes  and  block  structures  in  streamwise  and  chordwise  direction  (C-O,  C-H,  0-0,  H- 
H-type)  and  the  discretization  (FVM,  FDM).  Beyond  this  the  laminar-turbulent  transition 
(switch  on)  and  the  used  turbulence  model  (Baldwin-Lomax)  may  be  of  consiuerable 
influence.  This  is  doubtless  a  disillusion  and  not  recognized  with  pleasure  but  it  is 
important  for  all  further  investigations  in  that  field.  Without  strict  validation  of  the 
numerical  codes  no  reliability  can  be  guaranted! 


3.2 

The  next  validation  complex  is  experiment  and  computation  of  transonic  viscous 
channel  flow  with  ventilation.  Many  efforts  at  different  places  had  been  done  [5, 6, 7].  The 
practical  idea  is  the  reduction  of  shock  strength  by  passive  control  in  the  domain  of 
turbulent  shock-boundary  layer  interaction.  Figure  13  describes  on  the  one  hand  the 
scheme  of  the  test  section  in  a  laboratory  and  on  the  other  hand  the  different  zonal 
methods forthe  numerical  approach.  Outsidethe  boundary  layerwe  have  the  nonviscous 
flow  part.  We  solve  the  potential  equation  after  suitable  grid  generation  by  well  known 
difference  methods.  At  the  wall  we  start  with  laminar  boundary  layer,  after  transition  we 
continue  with  turbulent  boundary  layer.  In  the  shock  boundary  interaction  area  we  fit  in 
an  analytical  model  [8].  In  the  cavity  closed  with  a  punched  sheet  we  prescribe  a  constant 
pressure  and  fulfil  the  conservation  of  mass.  Downstream  we  have  a  turbulent  boundary 
layer  with  ot  without  separation.  An  inverse  integral  method  has  to  be  applied  [9].  The 


Transonic  Wind  Tunnel 


channel  wall 


laminar  boundary  layer  turbulent  boundary  layer  in  front  of  the  interference  area  turbulent  boundary  layer  behind  the  interference  area 
(direct  integral  method)  (direct  integral  method)  (inverse  integral  method) 


Figure  13.  Zonal  methods  for  computation  of  transonic  viscous  channel  flow  with 
ventilation. 


porosiiy  =  8.2  % 
cavity  length  =  50  mm 


Figure  14.  Comparison  of  theory 
and  experiment  for  ventilation. 
Porosity  =  8.2%,  cavity  length  = 
50  mm.  The  mass  flux 
corresponds  to  Mmax=1.2  in  the 
case  without  ventilation  [6], 


Figure  15.  Interfe-ogram  without  and  with  Figure  16.  Cascade  in  supersonic  flow, 
ventilation  and  the  corresponding  Mach  Top:  Reflexion  of  bow  shock  wave  at 
number  at  the  wall  [7].  the  profile.  Bottom:  Bow  shock  wave 

doses  the  supersonic  region. 


calculation  and  the  measurements  for  Mach  number  and  skin  friction  at  the  lower  wall  and 
the  displacement  -  and  momentum  thickness  agree  well  (Fig.  1 4).  The  comparison  with 
the  nonventilated  case  is  convincing.  The  shock  strength  is  considerably  reduced  (Fig. 
15).  For  a  single  profile  the  wave  resistance  decreases  -  until  50%  -  and  the  shock 
position  is  fixed.  It  is  remarkable  that  the  effect  of  passive  control  is  already  reached  with 
a  perforation  of  about  8%.  Further  on  the  inclination  of  the  perforation-channels  is  of 
considerable  influence  (oblique  perforation). 

The  method  described  here  may  be  of  interest  for  several  applications.  For 
compressor  cascades  in  supersonic  flow  for  instance  it  is  possible  to  reduce  the  loss  by 
passive  control  on  the  suction  side  of  the  blades  (Fig.  16).  Experiments  in  a  cascade 
tunnel  [10]  and  in  channel  test  section  obviously  show  this  effect.  Calculation  with  zonal 
methods  are  possible  in  a  way  we  have  described  above. 

In  the  last  time  another  idea  of  passive  control  has  been  propagated  [11].  The  cavity 
is  closed  by  an  elastic  membrane.  Depending  upon  the  elasticity  and  the  inner  pressure 
the  membrane  builds  up  some  kind  of  adaptive  wall  that  may  have  a  favourable  influence 
on  the  shock-boundary  layer  interaction. 


3.3 


A  very  interesting  new  experiment  tor  validation  of  modern  transonic  numerical 
methods  is  the  condensation,  or  more  general  the  study  of  flows  with  phase  transition, 
in  Laval  nozzles  and  around  profiles.  This  is  important  for  several  reasons.  On  the  one 
hand  this  is  a  diabatic  flow  with  interesting  new  physical  effects  on  a  microscopic  and  a 
macroscopic  scale  (nucleation  theory,  droplet  growth),  on  the  other  hand  there  are 
several  actual  applications  in  steam  turbines  and  in  cryogenic  tunnels,  not  to  forget  the 


Figure  17a,b.  Schlierenpictures  and  calculated  iso  Mach  lines.  Left:  4>0=  29%,  subcritical 
heat  addition;  Right:  <{>Q  =  60%,  supercritical  heat  addition. 


Figure  18.  Diabatic  flow  in  a  hyperbola 
nozzle.  On  the  axis:  Nucleation  rate  J. 
Condensing  mass  g/gmax .  Pressure  ratio 
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Figure  19a, b.  Changementofthepressure 
drag  for  circular  arc  profile  (CA-0.1) 
chordlength  c  =  80  mm  and  NACA-0012 
profile  {c  =  100  mm)  by  heat  addition  due 
to  homogeneous  condensation,  without 
incidence  [12]. 


ftigh  of  an  aircraft  in  humid  air.  Condensation  gives  rise  to  heat  release  that  influences 
the  pressure  distribution  considerably.  Strong  changements  of  lift  and  wave  drag 
coefficients  are  possible  in  moist  atmospheric  air  [1 2].  The  numerical  method  solves  the 
unsteady  2D-Euler  equations.  Iteration  is  finished  if  the  stationary  final  solution  is 
obtained.  All  equations  are  used  in  conservation  form  and  the  condensation  is  modelled 
by  two  successive  processes.  For  homogeneous  condensation  we  start  with  the 
nucleation  theory  by  Volmer  [1 3]  and  continue  with  the  droplet  growth  theory  according 
to  the  Hertz-Knudsen  model.  The  calculation  uses  a  finite-volume  method  that  generalizes 
a  corresponding  procedure  for  adiabatic  flows  [14].  Depending  upon  the  stagnation 
humidity  <j)0  we  find  different  structures  of  condensation  phenomena  in  the  supersonic 
part  of  a  nozzle  -  subcritical  and  supercritical  heat  addition  (Fig.  17a,b)  [15].  The 
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NoooquHitrium 

Equilibrium 

Ac0  «  -12% 

adiabatic  ;  Ac„  «  207% 

Acl  -  -30% 

Ac,  -  30% 

Figure  20.  Non¬ 
equilibrium,  adiabatic 
and  equilibrium  flow  for 
NACA-0012  profile,  a  = 
1.25°  ,  Mf  =  0.8,  AMf  = 
0.015(16). 


comparison  between  calculation  and  measurements  is  convincing.  This  is  valid  in 
respect  to  the  Schlierenpictures  with  iso-Machlines  and  in  respect  tc  the  static  pressure 
measurements  along  the  axis.  Figure  18  gives  as  microscopic  quantity  the  nucleation 
rate  J  and  as  macroscopic  values  the  condensed  mass  g/gmax  and  the  static  pressure 
p/p01 .  Concerning  profile  flow  figure  1 9  shows  for  circular  arc  (CA-0. 1 0)  and  NACA-001 2 
profile  the  results.  Depending  upon  the  stagnation  humidity  <{>0  the  pressure  drag  may 
be  considerably  reduced  (CA-0. 1 0)  or  increased  (N ACA-00 1 2)  [1 2).  A  simple  explanation 
is  as  follows:  For  circular  arc  profile  and  adiabatic  flow  the  shock  position  is  in  the 
neighbourhood  of  the  rear,  for  M(o  =  0.87.  Condensation  will  not  move  this  shock.  The 
changement  of  the  pressure  due  to  heat  addition  leads  to  decreasing  drag  with 
increasing  humidity.  For  NACA  profile  on  the  contrary  the  shock  position  in  adiabatic  flow 
is  in  the  middle  of  the  profile,  for  Mf  0.8.  Here  due  to  condensation  the  shock  moves. 
The  changement  of  the  pressure  due  to  heat  addition  leads  to  an  increasing  drag  with 
increasing  humidity.  Remarkable  is  the  typical  double  shock  structure  for  high  humidity! 
The  first  is  due  to  super-critical  heat  addition,  the  second  is  the  usual  gasdynamic  shock. 
If  unsymmetric  flows  with  incidence  are  considered  things  change  once  more  [1 2].  Now 
for  NACA  profile  the  shock  is  for  adiabatic  flow  in  the  neighbourhood  of  the  rear  and  drag 
and  lift  decrease  with  increasing  humidity.  Figure  20  shows  the  different  effects  between 
non-equilibrium  condensation  and  equilibrium  condensation  on  the  one  hand  and 
adiabatic  flow  on  the  other  hand.  The  case  of  equilibrium  condensation  corresponds  to 
atmospheric  flight  conditions  with  chordlength  c  =  1 500  mm.  A  drastic  increase  of  the 
drag  is  the  consequence. 
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4.  FUTURE  TRENDS 

The  last  examples  lead  already  to  efforts  in  future.  In  the  field  of  CFD-methods  now 
as  before  reliable  Euler-  and  Navier-Stokes  codes  for  actual  problems  of  modern 
aerodynamics  are  wanted.  Validation  is  necessary  in  any  case.  Of  special  interest  are 
airfoils  of  low  drag.  This  aim  may  be  reached  by  design  of  shockless  profiles  [17]  or  by 
active  or  passive  control  of  the  shock  boundary  layer  interaction. 

The  transonic  laminar  airfoil  is  simply  the  optimum.  If  it  is  possible  to  suppress  the 
instabilities  in  the  boundary  iayer  and  keep  it  laminar  over  the  main  part  of  the  airfoil  this 
would  be  a  great  success. 

Phase  transition  effects  are  of  increasing  importance  as  we  have  seen.  The 
situation  in  cryogenic  tunnels  belongs  to  these  subjects  and  the  study  of  condensation 
of  N2  is  of  great  interest  [1 8,1 9].  A  possible  application  of  phase  changing  fluid  flow  lies 
in  a  new  type  of  fluid  flow  machines,  using  a  so  called  Organic  Rankine  Cycle  (ORC). 
Some  new  investigations  show  that  remarkable  features  exist  which  lead  to  large 
efficiencies  of  these  systems  [20]. 

Last  not  least  the  transonic  flow  in  retrograde  (dense)  fluids  is  in  many  respects 
surprising  [2ij.  Some  well  known  classical  gasdynamic  effects  invert  [22,23,24].  For 
instance,  expansion  shocks  may  be  possible  with  a  favourable  influence  on  boundary 
layer  separation;  isentropic  compression  of  these  media  may  lead  to  condensation  and 
so  on. 

At  Symposium  Transsonicum  III  Earl  Murman  gave  a  delightful  comment:  "What  we 
are  doing  is  fun,  it  might  even  be  importanr.  After  working  in  the  fascinating  field  of 
transonic  flow  for  over  35  years  I  am  convinced  that  it  is  important. 
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